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Lepidopteran-Active Bacillus thuringiensis 
5-Endotoxin Compositions and Methods of Use 

1.0 Background of the Invention 
s 1.1 Field of the Invention 

The present invention relates generally to the fields of molecular biology. More 
particularly, certain embodiments concern methods and compositions comprising DNA 
segments, and proteins derived from; bacterial species. More particularly, it concerns novel 
genes from Bacillus thuringKmvs ^lip^in^ lepidopteran-toxic crystal proteins. Various methods j 

10 for making and using these pN^ segmi^^ DNA segments encoding' synthetically-modified Cry 
proteins, and native an^d s'3^thetic'ci^^ such as, for example, the use of 

DNA segments as diaghoistic ' prbbes and templates for protein production, and the use of 
proteins, fusion protein carriers aiid ^ peptides in various immunological and diagnostic 
applications. Also disclosed -are 'methods of making and using nucleic acid segments in the 

IS development of transgenic plant cells containing the DNA segments disclosed herein. 

1.2 Description of the Related Art 

Almost all field crops, plants, and commercial farming areas are susceptible to attack by 
one or more insect pests. Particularly problematic are Coleopteran and Lepidoptem pests. For 
example, vegetable and cole crops such as artichokes, kohlrabi, arugula, leeks, asparagus, lentils, 

20 beans, lettuce (e.g.. head, leaf, romaine), beets, bok choy, malanga, broccoli, melons {e.g., ^ 
muskmelon, watermelon, crenshaw, honeydew,cantaloupe), brussels sprouts, cabbage, cardoni, 
carrots, napa, cauliflower, okra, onions, celery, parsley, chick peas, parsnips, chicory, peas, 
Chinese cabbage, peppers, collards, potatoes, cucumber, pumpkins, cucurbits, radishes, dry bulb 
onions, rutabaga, eggplant, salsify, escarole, shallots, endive, soybean, garlic, spinach, green 

25 onions, squash, greens, sugar beets, sweet potatoes, turnip, swiss chard, horseradish, tomatoes, 
kale, turnips, and a variety of spices are sensitive to infestation by one or more of the following 
insect pests: alfalfa looper, armyworm, beet armyworm, artichoke plume moth, cabbage 
budworm, cabbage looper, cabbage webworm, com earworm, celery leafeater, cross-striped 
cabbageworm, european com borer, diamondback moth, green cloverworm, imported 

30 cabbageworm, melonworm, omnivorous leafroller, pickleworm, rindworm complex, saltmarsh 
caterpillar, soybean looper, tobacco budworm, tomato fmitworm, tomato homworm, tomato 
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pinworm, velvetbean caterpillar, and yellowstriped armyworm. Likewise, pasture and hay crops 
such as alfalfa, pasture grasses and silage are often attacked by such pests as armyworm, beef 
armyworm, alfalfa caterpillar, European skipper, a variety of loopers and webworms, as well as 
yellowstriped armyworms. 

5 Fruit and vine crops such as apples, apricots, cherries, nectarines, peaches, pears, plums, 

prunes, quince almonds, chestnuts, filberts, pecans, pistachios, walnuts, citrus, blackberries, 
blueberries, boysenberries, cranberries, currants, loganberries, raspberries, strawberries, grapes, 
avocados, bananas, kiwi, persimmons, pomegranate, pineapple, tropical fruits are often 
susceptible to attack and defoliation by achema sphinx moth, amorbia, armyworm, citrus 

10 cutworm, banana skipper, blackheaded fireworm, blueberry leafroller, cankerworm, cherry 
fruitworm, citrus cutworm, cranberry girdler, eastern tent caterpillar, fall webwoim, fall 
webworm, filbert leafroller, filbert webworm, fruit tree leafroller, grape berry moth, grape 
leaffolder, grapeleaf skeletonizer, green fruitworm, gummosos-batrachedra commosae, gypsy 
moth, hickory shuckworm, homwonns, loopers, navel orangeworm, obliquebanded leafroller, 

15 omnivorous leafroller. omnivorous looper, orange tortrix, orangedog, oriental fruit moth, 
pandemls leafroller, peach twig borer, pecan nut casebearer, redbanded leafroller, redhumped 
caterpillar, roughskinned cutworm, saltmarsh caterpillar, spanworm, tent caterpillar, thecla- 
thecla basillides, tobacco budworm, tortrix moth, tufted apple budmoth, variegated leafroller, 
walnut caterpillar, western tent caterpillar, and yellovrstriped armyworm. 

20 Field crops such as canola/rape seed, evening primrose, meadow foam, com (field, sweet, 

popcom), cotton, hops, jojoba, peanuts, rice, safflower, small grains (barley, oats, rye, wheat, 
etc.), sorghum, soybeans, sunflowers, and tobacco are often targets for infestation by insects 
including armyworm, asian and other com borers, banded sunflower moth, beet armywomfi, 
bollworm, cabbage looper, com rootworm (including southern and western varieties), cotton leaf 

25 perforator, diamondback moth, european com borer, green cloverworm, headmoth, headworm, 
imported cabbageworm, loopers (including Anacamptodes spp.), obliquebanded leafroller, 
omnivorous leaftier, podworm, podworm, saltmarsh caterpillar, southwestem com borer, 
soybean looper, spotted cutworm, sunflower moth, tobacco budworm, tobacco homworm, 
velvetbean caterpillar. 

30 Bedding plants, flowers, ornamentals, vegetables and container stock are frequently fed 

upon by a host of insect pests such as armyworm, azalea motii, beet armyworm, diamondback 
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moth, ello moth (homworm), Florida fern caterpillar, lo moth, loopers, oleander moth, 
omnivorous leafroller, omnivorous looper, and tobacco budworm. 

Forests, fruit, ornamental, and nut-bearing trees, as well as shrubs and other nursery stock 
are often susceptible to attack from diverse insects such as bagworm, blackheaded budworm, 

5 browntail moth, California oakworm, douglas fir tussock moth, elm spanworm, fall webworm, 
fiuittree leafroller, greenstriped mapleworm, gypsy moth, jack pine budworm, mimosa 
webworm, pine butterfly, redhumped caterpillar, saddleback caterpillar, saddle prominent 
caterpillar, spring and fall cankerworm, spruce budworm, tent caterpillar, tortrix, and western 
tussock moth. Likewise, turf grasses are often attacked by pests such as armyworm, sod 

10 webworm, and tropical sod webworm. 

Because crops of conunercial interest are often the target of insect attack, 
environmentally-sensitive methods for controlling or eradicating insect infestation are desirable 
in many instances. This is particularly true for farmers, nurserymen, growers, and commercial 
and residential areas which seek to control insect populations using eco-friendly compositions. 

IS The most widely used environmentally-sensitive insecticidal formulations developed in 

recent years have been composed of microbial pesticides derived from the bacterium Bacillus 
thuringiensis. B. thuringiensis is a Gram-positive bacterium that produces crystal proteins or 
inclusion bodies which are specifically toxic to certain orders and species of insects. Many 
different strains of B. thuringiensis have been shown to produce insecticidal crystal proteins. 

20 Compositions including B. thuringiensis strains which produce insecticidal proteins have been 
commercially-available and used as environmentally-acceptable insecticides because they are 
quite toxic to the specific target insect, but are harmless to plants and other non-targeted 
organisms. 

1.2.1 B. thuringiensis Crystal Proteins S-Endotoxins 

25 §-endotoxins are used to control a wide range of leaf-eating caterpillars and beetles, as 

well as mosquitoes. These proteinaceous parasporal crystals, also referred to as insecticidal 
crystal proteins, crystal proteins, Bt inclusions, crystaline inclusions, inclusion bodies, and Bt 
toxins, are a large collection of insecticidal proteins produced by B. thuringiensis that are toxic 
upon ingestion by a susceptible insect host. Over the past decade research on the structure and 

30 function of B. thuringiensis toxins has covered all of the major toxin categories, and while these 
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toxins differ in specific structure and function, general similarities in the structure and function 
are assumed. Based on the accumulated knowledge of B. thuringiensis toxins, a generalized 
mode of action for B. thuringiensis toxins has been created and includes: ingestion by the insect, 
solubilization in the insect midgut (a combination stomach and small intestine), resistance to 
5 digestive enzymes sometimes with partial digestion actually "activating" the toxin, binding to the 
midgut cells, formation of a pore in the insect cells and the disruption of cellular homeostasis 

(English and Slatin, 1992). 

One of the unique features oi B. thuringiensis is its production of crystal proteins during 
sporulation which are specifically toxic to certain orders and species of insects. Many different 

to strains of B. thuringiensis have been shown to produce insecticidal crystal proteins. 
Compositions including B. thuringiensis strains which produce proteins having insecticidal 
activity against lepidopteran and dipteran insects have been commercially available and used as 
environmentally-acceptable insecticides because they are quite toxic to the specific target insect, 
but are harmless to plants and other non-targeted organisms. 

13 The mechanism of insecticidal activity of the A thuringiensis crystal proteins has been 

studied extensively in the past decade. It has been shown that the crystal proteins are toxic to the 
insect only after ingestion of the protein by the insect. The alkaline pH and proteolytic enzymes 
in the insect mid-gut solubilize the proteins, thereby allowing the release of components which 
are toxic to the insect. These toxic components disrupt the mid-gut cells, cause the insect to 

20 cease feeding, and, eventually, bring about insect death. For this reason, B. thuringiensis has 
proven to be an effective and environmentally safe insecticide in dealing with various insect 
pests. 

As noted by Hdfte and Whiteley (1989), the majority of insecticidal B. thuringiensis 
strains are active against insects of the order Lepidoptera, caterpillar insects. Other B. 
25 thuringiensis strains are insecticidally active against insects of the order Diptera, i.e., flies and 
mosquitoes, or against both lepidopteran and dipteran msects. In recent years, a few B. 
thuringiensis strains have been reported as producing crystal proteins that are toxic to insects of 
the order Coleoptera, i.e., beetles (Krieg et al., 1983; Sick et ai, 1990; Donovan et al., 1992; 
Lambert et al, 1992a; 1992b). 



30 
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1.2.2 Genes Encoding Crystal Proteins 

Many of the 5-endotoxins are related to various degrees by similarities in their amino 
acid sequences. Historically, the proteins and the genes which encode them were classified 
based largely upon their spectrum of insecticidal activity. The review by Hofte and Whiteley 

s (1 989) discusses the genes and proteins that were identified in B. thuringiensis prior to 1990, and 
sets forth the nomenclature and classification scheme which has traditionally been applied to B. 
thuringiensis genes and proteins, cryj genes encode lepidopteran-toxic Cryl proteins. cryJI 
genes encode Cryll proteins that are toxic to both lepidopterans and dipterans. crylll genes 
encode coleopteran-toxic Crylll proteins, while crylV genes encode dipteran-toxic CrylV 

10 proteins. Based on the degree of sequence similarity, the proteins were fiirther classified into 
subfamilies; more highly related proteins within each family were assigned divisional letters 
such as CrylA, CrylB, CrylC, etc. Even more closely related proteins within each division were 
given neimes such as CrylCl , CryIC2, etc. 

Recently, a new nomenclature was developed which systematically classified the Cry 

15 proteins based upon amino acid sequence homology rather than upon insect target specificities 
(Crickmore ei ai, 1998). The classification scheme for many known toxins, including allelic 
variations in individual proteins, is summarized and regularly updated at 
http://www.biols.susx.ac.uk/Home/Neil_Crickmore/Bt/. The informationw was most recently 
updated as of April 27, 1999 and is herein incorporated by reference. 

20 1.2.3 Crystal Proteins Toxic to Lepidopteran Insects 

2.0 Summary of the Invention 

The recent review by Schnepf et al. (1998) describes the enormous diversity of 
insecticidal crystal proteins derived from B. thuringiensis. Cry proteins of the Cryl, Cry2, and 
Cry9 classes are particularly known for their toxicity towards lepidopteran larvae, however, the 

2S degree of toxicity varies significantly depending on the target lepidopteran pest (Hdfte and 
Whiteley, 1989). For instance, Cryl Ac shows poor toxicity towards the armyworm, Spodoptera 
littoralis, but strong toxicity towards the tobacco budworm, Heliothis virescens. In addition, 
slight variations in amino acid sequence within a Cry protein class can dramatically impact 
insecticidal activity (see Schnepf et al, 1998 and references therein). The Cry3Ba and Cry3Bb 

30 genes, for instance, share 94% amino acid sequence identity, but only Cry3Bb exhibits 
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significant toxicity towards the Southern com rootwom, Diabrotica undecimpmctata howardi 
(Donovan et ai. 1992). Similarly, Cry2Aa and Cry2Ab share 87% amino acid sequence identity, 
yet only Cry2Aa displays toxicity towards mosquitos (Widner and Whiteley, 1990). Von 
Tersch et al. (1991) demonstrated that Cry 1 Ac proteins varying by only seven amino acids 

5 (>99% sequence identity) nevertheless show significant differences in insecticidal activity. Lee 
et al. (1996) reported that CrylAb alleles differing at only two amino acid positions exhibited a 
10-fold difference in toxicity towards the gypsy moth, Lymantria dispar. Thus, even Cry 
proteins that are considered to be alleles of known Cry proteins or to belong to a Cry protein 
subclass (Crickmore et al., 1998) may have unique and useful insecticidal properties. 

10 International Patent Application Publication No. WO 98/00546 and WO 98/40490 describe a 
variety of Cryl-, Cry2-, and Cry9-related crystal proteins obtained from B. thuringiensis. 

2.1 Cry DNA Segments 

The present invention concerns nucleic acid segments, that can be isolated from virtually 
any source, that are free from total genomic DNA and that encode the novel peptides disclosed 

15 herein. Nucleic acid segments encoding these polypeptides may encode active proteins, peptides 
or peptide firagments, polypeptide subunits, functional domains, or the like of one or more crystal 
proteins. In addition the invention encompasses nucleic acid segments which may be 
synthesized entirely in vitro using methods that are well-known to those of skill in the art which 
encode the novel Cry polypeptides, peptides, peptide fragments, subunits, or functional domains 

20 disclosed herein. 

As used herein, the term "nucleic acid segment" refers to a polynucleotide molecule that 
has been isolated free of total genomic DNA of a particular species. Therefore, a nucleic acid 
segment encoding an endotoxin polypeptide refers to a nucleic acid segment that comprises one 
or more crystal protein-encoding sequences yet is isolated away from, or purified free from, total 

25 genomic DNA of the species from which the nucleic acid segment is obtained, which in the 
instant case is the genome of the Gram-positive bacterial genus, Bacillus, and in particular, the 
species of Bacillus known as B. thuringiensis. Included within the term "nucleic acid segment", 
are polynucleotide segments and smaller fragments of such segments, and also recombinant 
vectors, including, for example, plasmids, cosmids, phagemids, phages, vuuses, and the like. 

30 Similarly, a DNA segment comprising an isolated or purified crystal protein-encoding 

gene refers to a DNA segment which may include in addition to peptide encoding sequences. 
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certain other elements such as, regulatory sequences, isolated substantially away from other 
naturally occurring genes or protein-encoding sequences. In this respect, the term "gene" is used 
for simplicity to refer to a functional protein-, polypeptide- or peptide-encoding unit. As will be 
understood by those in the art, this functional term includes both genomic sequences, operon 
5 sequences and smaller engineered gene segments that express, or may be adapted to express, 
proteins, polypeptides or peptides. Also, the term includes an expression cassette comprising at 
least a promoter operably linked to one or more protein coding sequences, operably linked to at 
least a transcriptional termination sequence. 

"Isolated substantially away from other coding: sequences" means that the gene of 

10 interest, in this case, a nucleic acid segment or gene encoding all or part of a bacterial 
insecticidal crystal proteinj fdrms the significant part of the coding region of the DNA segment, 
and that the DNA segment does not contain large portions of naturally-occurring coding DNA, 
such as large chromosomal fragments or other functional nucleic acid segments or genes or 
operon coding regions. Of course, this refers to the DNA segment as originally isolated, and 

IS does not exclude genes, recombinant genes, synthetic linkers, or coding regions later added to 
the segment by the hand of man. 

In particular embodiments, the invention concerns isolated DNA segments and 
recombinant vectors incorporating DNA sequences that encode a Gry peptide species that 
includes within its amino acid sequence an amino acid sequence essentially as set forth in SEQ 

20 ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ 
ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20. SEQ ID NO:22, SEQ ID NO:24, 
SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID 
NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ 
ID NO:48, SEQ ID NO:50 and SEQ ID NO. 63. 

25 The term "a sequence essentially as set forth in SEQ ID NO:2, SEQ ID NO:4, or SEQ ID 

N0:6," for example, means that the sequence substantially corresponds to a portion of the 
sequence of SEQ ID N0:2, SEQ ID NO:4, or SEQ ID N0:6 and has relatively few amino acids 
that are not identitical with, or a biologically functional equivalent of, the amino acids of any of 
these sequences. The term "biologically functional equivalent" is well understood in the art and 

30 is further defined in detail herein {e.g., see Illustrative Embodiments). Accordingly, sequences 
that have from about 70% to about 80%, or more preferably about 81, 82, 83, 84, 85, 86, 87, 88, 
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89, or about 90%, or even more preferably about 91, 92, 93, 94, 95, 96, 97, 98, or about 99% 
amino acid sequence identity or functional equivalence to the amino acids of SEQ ID N0:2, SEQ 
ID NO:4, SEQ ID N0:6, SEQ ID N0:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO: 14, SEQ 
ID N0:16, SEQ ID N0:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, 

5 SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID 
NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ 
ID NO:50 and SEQ ID NO. 63 will be sequences that are "essentially as set forth in SEQ ID 
NO:2, SEQ ID N0:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID 
N0:14, SEQ ID N0:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ 

10 ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, 
SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID 
NO:48, SEQ ID NO:50 and SEQ ID NO: 63." 

In addition, sequences that have from about 70% to about 80%, or more preferably about 
81, 82, 83, 84, 85, 86, 87, 88, 89, or about 90%, or even more preferably about 91, 92, 93, 94, 

IS 95, 96, 97, 98, or about 99% nucleic acid sequence identity or functional equivalence to the 
nucleic acids of SEQ ID NO:l, SEQ ID N0:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, 
SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID 
NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ 
ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID N0:41, SEQ ID NO:43, 

20 SEQ ID N0:45, SEQ ID NO:47, SEQ ID NO:49 and SEQ ID NO:62 will be sequences that are 
"essentially as set forth in SEQ ID NO:l, SEQ ID NO:3, SEQ ID N0:5, SEQ ID NO:7, SEQ ID 
NO:9, SEQ ID N0:11, SEQ ID NO:13, SEQ ID N0:15, SEQ ID NO:17, SEQ ID NO:19, SEQ 
ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID N0:31, 
SEQ ID N0:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:41. SEQ ID 

25 NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49 and SEQ ID NO:62." 

It will also be understood that amino acid and nucleic acid sequences may include 
additional residues, such as additional N- or C-terminal amino acids or 5' or 3' sequences, and 
yet still be essentially as set forth in one of the sequences disclosed herein, so long as the 
sequence meets the criteria set forth above, including the maintenance of biological protein 

30 activity where protein expression is concerned. The addition of terminal sequences particularly 
applies to nucleic acid sequences that may, for example, include various non-coding sequences 
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flanking either of the 5' or 3' portions of the coding region or may include various internal 
sequences, i.e., introris, which are known to occur within genes. 

The nucleic acid segments of the present invention, regardless of the length of the coding 
sequence itself, may be combined with other DNA sequences, such as promoters, 
5 polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding 
segments, and the like, such that their overall length may vary considerably. It is therefore 
contemplated that a nucleic acid fragment of almost any length may be employed, with the total 
length preferably being limited by the ease of preparation and use in the intended recombinant 
DNA protocol. For example, nucleic acid fragments may be prepared that include a short 1 

10 , contiguous stretch encoding any of the peptide sequences disclosed in SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 
NO: 1 6, SEQ ID NO: 1 8, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ 
ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, 
SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID 

15 NO:50 and SEQ ID NO: 63, or that are identical with or complementary to DNA sequences 
which encode any of the peptides disclosed in SEQ ID NO:2, SEQ ID NO:4, SEQ ID N0:6, 
SEQ ID NO:8, SEQ ID NOrlO, SEQ ID N0:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID 
NO: 1 8, SEQ ID NO:20, SEQ ID N0:22, SEQ ID NO:24. SEQ ID NO:26, SEQ ID NO:28, SEQ 
ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, 

20 SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50 and SEQ ID ^ 
NO: 63, and particularly those DNA segments disclosed in SEQ ID NO:l, SEQ ID NO:3, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID N0:9, SEQ ID N0:11, SEQ ID N0:13, SEQ ID NO:15, SEQ 
ID NO: 17. SEQ ID NO: 19, SEQ ID N0:21, SEQ ID NO:23. SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID 

25 NO:39, SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49 and 
SEQ ID NO: 62. For example, DNA sequences such as about 18 nucleotides, and that are up to 
about 10,000, about 5,000, about 3,000, about 2,000, about 1,000, about 500, about 200, about 
100, about 50, and about 14 base pairs in length (including all intermediate lengths) are also 
contemplated to be useful. 

30 It will be readily understood that "intermediate lengths", in these contexts, means any 

length between the quoted ranges, such as 18, 19, 20, 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 
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ss, etc.; 100, 101, 102, 103, etc.; 150, 151, 152, 153, etc.; including all integers in the ranges of 
from about 200-500; 500-1,000; 1,000-2,000; 2,000-3,000; 3,000-5,000; and up to and including 
sequences of about 1 0,00 or so nucleotides and the like. 

It will also be understood that this invention is not limited to the particular nucleic acid 

5 sequences which encode peptides of the present invention, or which encode the amino acid 
sequences of SEQ ID NO:2, SEQ ID N0:4, SEQ ID N0:6, SEQ ID N0:8, SEQ ID NO:10, SEQ 
ID N0:12, SEQ ID N0:14, SEQ ID N0:16, SEQ ID N0:18, SEQ ID NO:20, SEQ ID NO:22, 
SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44. SEQ 

10 ID NO:46, SEQ ID NO:48, SEQ ID NO:50 and SEQ ID NO: 63, including those DNA 
sequences which are particularly disclosed in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ ID NO:13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ 
ID N0:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, 
SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:35. SEQ ID NO:37, SEQ ID NO:39, SEQ ID 

,5 N0:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49 and SEQ ID NO:62. 
Recombinant vectors and isolated DNA segments may therefore variously include the peptide- 
coding regions themselves, coding regions bearing selected alterations or modifications in the 
basic coding region, or they may encode larger polypeptides that nevertheless include these 
peptide-coding regions or may encode biologically functional equivalent proteins or peptides that 

20 have variant amino acids sequences. 

The DNA segments of the present invention encompass biologically-functional, 
equivalent peptides. Such sequences may arise as a consequence of codon degeneracy and 
functional equivalency that are known to occur naturally within nucleic acid sequences and the 
proteins thus encoded. Alternatively, functionally-equivalent proteins or peptides may be 

25 created via the application of recombinant DNA technology, in which changes in the protein 
structure may be engineered, based on considerations of the properties of the amino acids being 
exchanged. Changes designed by man may be introduced through the application of site-directed 
mutagenesis techniques, e.g., to introduce improvements to the antigenicity of the protein or to 
test mutants in order to examine activity at the molecular level. 

30 If desired, one may also prepare fusion proteins and peptides, e.g., where the peptide- 

coding regions are aligned within the same expression unit with other proteins or peptides having 
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desired functions, such as for purification or immunodetection purposes (e.g., proteins that may 
be purified by affinity chromatography and enzyme label coding regions, respectively). 

Recombinant vectors form further aspects of the present invention. Particularly useful 
vectors are contemplated to be those vectors in which the coding portion of the DNA segment, 
5 whether encoding a full length protein or smaller peptide, is positioned under the control of a 
promoter. The promoter may be in the form of the promoter that is naturally associated with a 
gene encoding peptides of the present invention, as may be obtained by isolating the 5' non- 
coding sequences located upstream of the coding segment or exon, for example, using 

recombinant cloning and/or PGR™ technology, in connection with the compositions disclosed 

I 

10 herein. 

2.2 Cry DNA Segments as Hybridization Probes And Primers 

In addition to their use in directing the expression of crystal proteins or peptides of the 
present invention, the nucleic acid sequences contemplated herein also have a variety of other 
uses. For example, they also have utility as probes or primers in nucleic acid hybridization 

15 embodiments. As such, it is contemplated that nucleic acid segments that comprise a sequence 
region that consists of at least a 14 nucleotide long contiguous sequence that has the same 
sequence as, or is complementary to, a 14 nucleotide long contiguous DNA segment of SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID 
NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID N0:21, SEQ ID N0.23, SEQ 

20 ID NO:25, SEQ ID N0:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:35, 
SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID 
NO:47, SEQ ID NO:49 and SEQ ID NO:62 will find particular utility. Longer contiguous 
identical or complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000, 
2000, 5000 bp, etc. (including all intermediate lengths and up to and including the full-length 

25 gene sequences encoding each polypeptide will also be of use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to crystal protein- 
encoding sequences will enable them to be of use in detecting the presence of complementary 
sequences in a given sample. However, other uses are envisioned, including the use of the 
sequence information for the preparation of mutant species primers, or primers for use in 

30 preparing other genetic constructions. 
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Nucleic acid molecules having sequence regions consisting of contiguous nucleotide 
stretches of about 14 to about 17 or so, 18-25, 26-35, 36-50, or even up to and including 
sequences of about 100-200 nucleotides or so, identical or complementary to DNA sequences of 
SEQ ID N0:1, SEQ ID N0:3, SEQ ID NO:5, SEQ ID N0:7, SEQ ID N0:9, SEQ ID N0:11, 

5 SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID N0:21, SEQ ID 
NO:23, SEQ ID NO:25, SEQ ID N0:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ 
ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID N0:41, SEQ ID NO:43, SEQ ID NO:45, 
SEQ ID NO:47, SEQ ID NO:49 and SEQ ID NO:62, are particularly contemplated as 
hybridization probes for use in, e.g.. Southern and -Northern blotting. Smaller fragments will 

10 generally find use in hybridization embodiments, wherein the length of the contiguous 
complementary region may be varied, such as bet>yeen about 10-14 and about 100 to 200 or so 
nucleotides, but larger contiguous complementarity stretches may be used, according to the 
length complementary sequences one wishes to detect. 

Of course, fragments may also, be obtained by other techniques such as, e.g., by 

15 mechanical shearing or by restriction enzyme digestion. Small nucleic acid segments or 
fragments may be readily prepared by, for example, directly synthesizing the fragment by 
chemical means, as is commonly practiced using an automated oligonucleotide synthesizer. 
Also, fragments may be obtained by application of nucleic acid reproduction technology, such as 
the PGR™ technology of U. S. Patents 4,683,195 and 4,683,202 (each incorporated herein by 

20 reference), by introducing selected sequences into recombinant vectors for recombinant 
production, and by other recombinant DNA techniques generally known to those of skill in the 
art of molecular biology. 

Accordingly, the nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules with complementary stretches of DNA fragments. Depending 

25 on the application envisioned, one will desire to employ varying conditions of hybridization to 
achieve varying degrees of selectivity of probe towards target sequence. For applications 
requiring high selectivity, one will typically desire to employ relatively stringent conditions to 
form the hybrids, e.g., one will select relatively low salt and/or high temperature conditions, such 
as provided by about 0.02 M to about 0.15 M NaCl at temperatures of about 50°C to about 70°C. 

30 Such selective conditions tolerate little, if any, mismatch between the probe and the template or 
target strand, and would be particularly suitable for isolating crystal protein-encoding DNA 



BNSDOCID: <WO_01198S9A2J_> 



wo 01/19859 



PCT/USOO/25361 



- 13- 

segments. Detection of DNA segments via hybridization is well-known to those of skill in the 
art, and the teachings of U. S. Patents 4,965,188 and 5,176,995 (each incorporated herein by 
reference) are exemplary of the methods of hybridization analyses. Teachings such as those 
found in the texts of Maloy et ai, 1990; Maloy 1994; Segal, 1976; Prokop, 1991; and Kuby, 
5 1991, are particularly relevant. 

Of course, for some applications, for example, where one desires to prepare mutants 
employing a mutant primer strand hybridized to an underlying template or where one seeks to 
isolate crystal protein-encoding sequences from related species, functional equivalents, or the 
like, less stringent hybridization conditions will typically be needed in order to allow formation 

10 of the heteroduplex. In these circumstances, one may desire to employ conditions such as about 
0.15 M to about 0.9 M salt, at temperatures ranging from about 20°C to about 55°C. Cross- 
hybridizing species can thereby be readily identified as positively hybridi2dng signals with 
respect to control hybridizations. In any case, it is generally appreciated that conditions can be 
rendered more stringent by the addition of increasing amounts of formamide, which serves to 

IS destabilize the hybrid duplex in the same manner as increased temperature. Thus, hybridization 
conditions can be readily manipulated, and thus will generally be a method of choice depending 
on the desired results. 

In certain embodiments, it will be advantageous to employ nucleic acid sequences of the 
present invention in combination with an appropriate means, such as a label, for determining 

20 hybridization. A wide variety of appropriate indicator means are known in the art, including ^ 
fluorescent, radioactive, enzymatic or other ligands, such as avidin/biotin, which are capable of 
giving a detectable signal. In preferred embodiments, one will likely desire to employ a 
fluorescent label or an enzyme tag, such as urease, alkaline phosphatase or peroxidase, instead of 
radioactive or other enviroiunentally undesirable reagents. In the case of enzyme tiags, 

25 colorimetric indicator substrates are known that can be employed to provide a means visible to 
the human eye or spectrophotometrically, to identify specific hybridization with complementary 
nucleic acid-containing samples. 

In general, it is envisioned that the hybridization probes described herein will be useful 
both as reagents in solution hybridization as well as in embodiments employing a solid phase. In 

30 embodiments involving a solid phase, the test DNA (or RNA) is adsorbed or otherwise affixed to 
a selected matrix or surface. This fixed, single-stranded nucleic acid is then subjected to specific 
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hybridization with selected probes under desired conditions. The selected conditions will depend 
on the particular circumstances based on the particular criteria required (depending, for example, 
on the G+C content, type of target nucleic acid, source of nucleic acid, size of hybridization 
probe, etc.). Following washing of the hybridized surface so as to remove nonspecifically bound 
probe molecules, specific hybridization is detected, or even quantitated, by means of the label. 

2.3 Vectors and Methods for Recombinant Expression of Cry Polypeptides 

In other embodiments, it is contemplated that certain advantages will be gained by 
positioning the coding DNA segment under the control of a recombinant, or heterologous, 
promoter. As used herein, a recombinant or heterologous pron^oter is intended to refer to a 
promoter that is not normally associated with a DNA segment encoding a crystal protein or 
peptide in its natural environment. Such promoters may include promoters normally associated 
with other genes, and/or promoters isolated from any bacterial, viral, eukaryotic, or plant cell. 
Naturally, it will be important to employ a promoter that effectively directs the expression of the 
DNA segment in the cell type, organism, or even animal, chosen for expression. The use of 
promoter and cell type combinations for protein expression is generally known to those of skill 
in the art of molecular biology, for example, see Sambrook et al., 1989. The promoters 
employed may be constitutive, or inducible, and can be used under the appropriate conditions to 
direct high level expression of the introduced DNA segment, such as is advantageous in the 
large-scale production of recombinant proteins or peptides. Appropriate promoter systems 
contemplated for use in high-level expression include, but are not limited to, the Pichia 
expression vector system (Pharmacia LKB Biotechnology). 

In connection with expression embodiments to prepare recombinant proteins and 
peptides, it is contemplated that longer DNA segments will most often be used, with DNA 
segments encoding the entire peptide sequence being most preferred. However, it will be 
appreciated that the use of shorter DNA segments to direct the expression of crystal peptides or 
epitopic core regions, such as may be used to generate anti-crystal protein antibodies, also falls 
within the scope of the invention. DNA segments that encode peptide antigens from about 8 to 
about 50 amino acids in length, or more preferably, from about 8 to about 30 amino acids in 
length, or even more preferably, from about 8 to about 20 amino acids in length are contemplated 
to be particularly useful. Such peptide epitopes may be amino acid sequences which comprise 
contiguous amino acid sequences from SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
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NO:8, SEQ ID NO:10. SEQ ID N0:12, SEQ ID N0:14, SEQ ID N0:16, SEQ ID NO:18, SEQ 
ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, 
SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID 
NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50 and SEQ ID NO: 63. 

s 2.4 cry Transgenes and Transgenic Plants Expressing Cry Polypeptides 

In yet another aspect, the present invention provides methods for producing a transgenic 
plant which expresses a nucleic acid segment encoding the novel polypeptides and endotoxins of 
the present invention. The process of producing transgenic plants is well-known in the art. In 
general, the method comprises transforming a suitable host cell with a DNA segment which 

10 contains a promoter operatively linked to a coding region that encodes one or more CryET31, 
CryET40, CryET43, CryET44, CryET45, CryET46, CryET47, CryET49, CiyETSl, CryET52, 
CryET53, CiyET54, CiyET55, CryET56, CryET57, CryET59, CryET60. CiyET61, CryET62, 
CryET63, CryET64, CryET66, CryET67, CryET68, CryET72, CryET73, and CryET83 
polypeptides. Such a coding region is generally operatively linked to a transcription-terminating 

15 region, whereby the promoter is capable of driving the transcription of the coding region in the 
cell, and hence providing the cell the ability to produce the polypeptide in vivo. Alternatively, in 
instances where it is desirable to control, regulate, or decrease the amount of a particular 
recombinant crystal protein expressed in a particular transgenic cell, the invention also provides 
for the expression of crystal protein antisense mRNA. The use of antisense mRNA as a means of 

20 controlling or decreasing the amount of a given protein of interest in a cell is well-known in the 
art. 

Another aspect of the invention comprises transgenic plants which express a gene or gene 
segment encoding one or more of the novel polypeptide compositions disclosed herein. As used 
herein, the term "transgenic plant" is intended to refer to a plant that has incorporated DNA 

25 sequences, including but not limited to genes which are perhaps not normally present, DNA 
sequences not normally transcribed into RNA or translated into a protein ("expressed"), or any 
other genes or DNA sequences which one desires to introduce into the non-transformed plant, 
such as genes which may normally be present in the non-transformed plant but which one desires 
to either genetically engineer or to have altered expression. 

30 It is contemplated that in some instances either the nuclear or plastidic genome, or both,' 

of a transgenic plant of the present invention will have been augmented through the stable 
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introduction of one or more cryETSl, cryET40. cryET43, cryET44. cryET45, cryET46. cryET47. 
cryET49. cryETSl. cryET52. cryET53, cryET54, cryETSS, cryET56, cryET57. cryET59. 
cryET60, cryET61. cryET62, cryET63. cryET64. cryET66, cryET67, cryET68, cryET72, 
cryET73, and cryET83 transgenes, either native, synthetically modified, or mutated. In some 
5 instances, more than one transgene will be incorporated into one or more genomes of the 
transformed host plant cell. Such is the case when more than one crystal protein-encoding DNA 
segment is incorporated into the genome of such a plant. In certain situations, it may be 
desirable to have one, two, three, four, or even more B. thuringiensis crystal proteins (either 
^ native or recombinantly-engineered) incorporated and stably expressed in the transformed 
10 transgenic plant. 

A preferred gene which may be introduced includes, for example, a crystal protein- 
encoding DNA sequence from bacterial origin, and particularly one or more of those described 
herein which are obtained from Bacillus spp. Highly preferred nucleic acid sequences are those 
obtained from B. thuringiensis, or any of those sequences which have been genetically 
15 engineered to decrease or increase the insecticidal activity of the crystal protein in such a 
transformed host cell. 

Means for transforming a plant cell and the preparation of a transgenic cell line are well- 
known in the art, and are discussed herein. Vectors, plasmids, cosmids, YACs (yeast artificial 
chromosomes) and DNA segments for use in transforming such cells will, of course, generally 
- 20 comprise either the operons, genes, or gene-derived sequences of the present invention, either 
native, or synthetically-derived, and particularly those encoding the disclosed crystal proteins. 
These DNA constructs can further include structures such as promoters, enhancers, polylinkers, 
or even gene sequences which have positively- or negatively-regulating activity upon the 
particular genes of interest as desired. The DNA segment or gene may encode either a native or 
25 modified crystal protein, which will be expressed in the resultant recombinant cells, and/or 
which will impart an improved phenotype to the regenerated plant. 

Such transgenic plants may be desirable for increasing the insecticidal resistance of a 
monocotyledonous or dicotyledonous plant, by incorporating into such a plant, a transgenic 
DNA segment encoding one or more CryET31, CryET40, CryET43, CryET44, CiyET45, 
30 CryET46, CryET47, CryET49, CryETSl, CryET52, CryET53, CryET54, CryETSS. CryET56, 
CryET57, CryET59, CryET60, CryET61, CryET62, CryET63, CryET64, CryET66, CryET67, 
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CryET68, CryET72, CryET73, and CryET83 polypeptides which are toxic to a lepidopteran 
insect. Particularly preferred plants include turf grasses, kapok, sorghum, cotton, com, soybeans, 
oats, rye, wheat, flax, tobacco, rice, tomatoes, potatoes, or other vegetables, ornamental plants, 
fruit trees, and the like. 

5 In a related aspect, the present invention also encompasses a seed produced by the 

transformed plant, a progeny from such seed, and a seed produced by the progeny of the original 
transgenic plant, produced in accordance with the above process. Such progeny and seeds will 
have a crystal protein-encoding transgene stably incorporated into their genome, and such 
progeny plants will inherit the traits afforded by the introduction of a stable transgene in 

10 Mendelian fashion. All such transgenic plants having incorporated into their genome transgenic 
DNA segments encoding one or more CryETSl, CryET40, CiyET43, CryET44, CryET45, 
CryET46, CryET47, CryET49, CryETSl, CryET52, CryETSB, CryET54, CryET55. CryET56, 
CryET57, CryET59, CryET60, CryET61, CryET62. CryET63, CryET64, CiyET66, CiyET67, 
CryET68, CiyET72, CryET73, and CryET83 crystal proteins or polypeptides are aspects of this 

15 invention. As well-known to those of skill in the art, a progeny of a plant is understood to mean 
any offspring or any descendant from such a plant, but in this case means any offspring or any 
descendant which also contains the transgene. 

2.5 Site-Specific Mutagenesis 

Site-specific mutagenesis is a technique useful in the preparation of individual peptides, 
20 or biologically ftinctional equivalent proteins or peptides, through specific mutagenesis of the 
underlying DNA. The technique further provides a ready ability to prepare and test sequence 
variants, for example, incorporating one or more of the foregoing considerations, by introducing 
one or more nucleotide sequence changes into the DNA. The technique of site-specific 
mutagenesis is well known in the art, as exemplified by various publications. 
25 In general, site-directed mutagenesis in accordance herewith is performed by first 

obtaining a single-stranded vector or melting apart of two strands of a double stranded vector 
which includes within its sequence a DNA sequence which encodes the desired peptide. An 
oligonucleotide primer bearing the desired mutated sequence is prepared, generally synthetically. 
This primer is then aimealed writh the single-stranded vector, and subjected to DNA polymerizing 
30 enzymes such as E. coli polymerase I Klenow fragment, in order to complete the synthesis of the 
mutation-bearing strand. Thus, a heteroduplex is formed wherein one strand encodes the original 
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t 

non-mutated sequence and the second strand bears the desired mutation. This heteroduplex 
vector is then used to transform appropriate cells, such as E. coli cells, and clones are selected 
which include recombinant vectors bearing the mutated sequence arrangement. 

The preparation of sequence variants of the endotoxin-encoding nucleic acid segments 
5 using site-directed mutagenesis is provided as a means of producing potentially useful species 
and is not meant to be limiting as there are other ways in which sequence variants of peptides 
and the DNA sequences encoding them may be obtained. For example, recombinant vectors 
encoding the desired peptide sequence may be treated with mutagenic agents, such as 
\ hydroxylamine, to obtain sequence variants. 

10 2.6 Antibody Compositions and Methods of Making 

In particular embodiments, the inventors contemplate the use of antibodies, either 
monoclonal (mAbs) or polyclonal which bind to one or more of the polypeptides disclosed 
herein. Means for preparing and characterizing antibodies are well known in the art (See, e.g., 
Harlow and Lane, 1988; incorporated herein by reference). mAbs may be readily prepared 
15 through use of well-known techniques, such as those exemplified in U. S. Patent 4,196,265, 
incorporated herein by reference. 

2.7 ELISAs and Immunoprecipitation 

ELlSAs may be used in conjunction with the invention. Many different protocols exist 
^ for performing ELISAs. These are well known to those of ordinary skill in the art. Examples of 
20 basic ELISA protocols may be found in any standard molecular biology laboratory manual (e.g. 
Sambrook, Fritsch, and Maniatis, eds. Molecular cloning: a laboratory manual. Cold Spring 
Harbor, NY: Cold Spring Harbor Laboratory, 1989). 

2.8 Western Blots 

The compositions of the present invention will find great use in immunoblot or western 
25 blot analysis. Methods of performing immunoblot and western blot analysis are well known to 
those of skill in the are (see Sambrook, et al, ibid). Immunologically-based detection methods 
for use in conjunction with Western blotting include enzymatically-, radiolabel-, or 
fluorescently-tagged secondary antibodies against the toxin moiety are considered to be of 
particular use in this regard. 
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2.9 Crystal Protein Screening and Detection Kits 

The present invention contemplates methods and kits for screening samples suspected of 
containing crystal protein polypeptides or crystal protein-related polypeptides, or cells producing 
such polypeptides, A kit may contain one or more antibodies of the present invention, and may 
5 also contain reagent(s) for detecting an interaction betv^een a sample and an antibody of the 
present invention. The provided reagent(s) can be radio-, fiuorescently- or enzymatically-labeled 
or even epitope or ligand tagged. The kit can contain a known radiolabeled agent capable of 
binding or interacting with a nucleic acid or antibody of the present invention. 

The reagent(s) of the kit can be provided as a liquid solution, attached to a solid support 

10 or as a dried powder. Preferably, when the reagent(s) are provided in a liquid solution, the liquid 
solution is an aqueous solution. Preferably, when the reagent(s) provided are attached to a solid 
support, the solid support can be chromatograph media, a test plate having a plurality of wells, or 
a microscope slide. When the reagent(s) provided are a dry powder, the powder can be 
reconstituted by the addition of a suitable solvent, that may be provided. 

15 In still further embodiments, the present invention concerns inununodetection methods 

and associated kits. It is proposed that the crystal proteins or peptides of the present invention 
may be employed to detect antibodies having reactivity therewith, or, alternatively, antibodies 
prepared in accordance with the present invention, may be employed to detect crystal proteins or 
crystal protein-related epitope-containing peptides. In general, these methods will include first 

20 obtaining a sample suspected of containing such a protein, peptide or antibody, contacting the 
sample with an antibody or peptide in accordance with the present invention, as the case may be, 
under conditions effective to allow the formation of an immunocomplex, and then detecting the 
presence of the immunocomplex. 

In general, the detection of immunocomplex formation is quite well known in the art and 

25 may be achieved through the application of numerous approaches. For example, the present 
invention contemplates the application of ELISA, RIA, immunoblot (e.g., dot blot), indirect 
immunofluorescence techniques and the like. One may find additional advantages through the 
use of a secondary binding ligand such as a second antibody or a biotin/avidin ligand binding 
arrangement, as is known in the art. 

30 For assaying purposes, it is proposed that virtually any sample suspected of comprising 

either a crystal protein or peptide or a crystal protein-related peptide or antibody sought to be 
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detected, as the case may be, may be employed. It is contemplated that such embodiments may 
have application in the titering of antigen or antibody samples, in the selection of hybridomas, 
and the like. In related embodiments, the present invention contemplates the preparation of kits 
that may be employed to detect the presence of crystal proteins or related peptides and/or 
antibodies in a sample. Samples may include cells, cell supematants, cell suspensions, cell 
extracts, enzyme fractions, protein extracts, or other cell-free compositions suspected of 
containing crystal proteins or peptides. 

Generally speaking, kits in accordance with the present invention will include a suitable 
crystal protein, peptide or an antibody directed against such a protein or peptide, together with an 
immunodetection reagent and a means for containing the antibody or antigen and reagent. The 
immunodetection reagent will typically comprise a label associated with the antibody or antigen, 
or associated with a secondary binding ligand. Exemplary ligands might include a secondary 
antibody directed against the first antibody or antigen or a biotin or avidin (or streptavidin) 
ligand having an associated label. Of course, as noted above, a number of exemplary labels are 
known in the art and all such labels may be employed in connection with the present invention. 

The container will generally include a vial into which the antibody, antigen or detection 
reagent may be placed, and preferably suitably aliquotted. The kits of the present invention will 
also typically include a means for containing the antibody, antigen, and reagent containers in 
close confinement for commercial sale. Such containers may include injection or blow-molded 
plastic containers into which the desired vials are retained. 
2.10 Epitopic Core Sequences 

The present invention is also directed to protein or peptide compositions, free from total 
cells and other peptides, which comprise a purified protein or peptide which incorporates an 
epitope that is immunologically cross-reactive with one or more anti-crystal protein antibodies. 
In particular, the invention concerns epitopic core sequences derived from Cry proteins or 
peptides. 

As used herein, the term "incorporating an epitope(s) that is immunologically cross- 
reactive with one or more anti-crystal protein antibodies" is intended to refer to a peptide or 
protein antigen which includes a primary, secondary or tertiary structure similar to an epitope 
located within a crystal protein or polypeptide. The level of similarity will generally be to such a 
degree that monoclonal or polyclonal antibodies directed against the crystal protein or 
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polypeptide will also bind to, react with, or otherwise recognize, the cross-reactive peptide or 
protein antigen. Various immunoassay methods may be employed in conjunction with such 
antibodies, such as, for example. Western blotting, ELISA, RIA, and the like, all of which are 
known to those of skill in the art. The identification of Cry immunodominant epitopes, and/or 

5 their functional equivalents, suitable for use in vaccines is a relatively straightforward matter 
(e.g. U. S. Patent 4,554,101: Jameson and Wolf, 1988; Wolf et al, 1988; U. S- Patent 
4,554,101). The amino acid sequence of these "epitopic core sequences" may then be readily 
incorporated into peptides, either through the application of peptide synthesis or recombinant 
technology. ) 

10 Preferred peptides for use in accordance with the present invention will generally be on 

the order of about 8 to about 20 amino acids in length, and more preferably about 8 to about 15 
amino acids in length. It is proposed that particular advantages of the present invention may be 
realized through the preparation of synthetic peptides which include modified and/or extended 
epitopic/immimogenic core sequences which result in a "universal" epitopic peptide directed to 



15 crystal proteins, and in particular CryET31, GryET40, CryET43, CryET44, CryET45, CryET46, 
CryET47, CryET49, CryETSl, CryET52, CryET53, CryET54, CryET55, CryET56, CryET57, 
CryET59, CryET60, CryET61, CryET62. CiyET63, CryET64, CryET66, CryET67, CryET68, 
CryET72, CryET73, CryET83 and related sequences. These epitopic core sequences are 
identified herein in particular aspects as hydrophilic regions of the particular polypeptide 



Computerized peptide sequence analysis programs (e.^., DNAStai<g> soihvare, DNAStar, 
. Inc., Madison, WI) may also be useful in designing synthetic peptides in accordance with the 
present disclosure. 

Syntheses of epitopic sequences, or peptides which include an antigenic epitope within 
2s their sequence, are readily achieved using conventional synthetic techniques such as the solid 
phase method {e.g., through the use of commercially available peptide synthesizer such as an 
Applied Biosystems Model 430A Peptide Synthesizer). 

2.11 Biological Functional Equivalents 



30 invention and DNA segments which encode them and still obtain a functional molecule that 
encodes a protein or peptide with desirable characteristics. The following is a discussion based 



20 antigen. 




Modification and changes may be made in the structure of the peptides of the present 
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upon changing the amino acids of a protein to create an equivalent, or even an improved, second- 
generation molecule. In particular embodiments of the invention, mutated crystal proteins are 
contemplated to be useful for increasing the insecticidal activity of the protein, and consequently 
increasing the insecticidal activity and/or expression of the recombinant transgene in a plant cell. 
5 The amino acid changes may be achieved by changing the codons of the DNA sequence, 
according to the codons given in Table I. 
Table 1 



Amino Acids Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


UUU 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


lie 


I 


AUA 


AUG 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 




ecu 






Proline 


Pro 


P 


CCA 


CCC 


CCG 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


GGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Tip 


w 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











For example, certain amino acids may be substituted for other amino acids in a protein 
10 structure without appreciable loss of interactive binding capacity with structures such as, for 
example, antigen-binding regions of antibodies or binding sites on substrate molecules. Since it 
is the interactive capacity and nature of a protein that defines that protein's biological functional 
activity, certain amino acid sequence substitutions can be made in a protein sequence, and, of 
course, its underlying DNA coding sequence, and nevertheless obtain a protein with like 
15 properties. It is thus contemplated by the inventors that various changes may be made in the 
peptide sequences of the disclosed compositions, or corresponding DNA sequences which 
encode said peptides without appreciable loss of their biological utility or activity. 
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In making such changes, the hydropathic index of amino acids may be considered. The 
importance of the hydropathic amino acid index in conferring interactive biologic function on a 
protein is generally understood in the art (Kyte and Doolittle, 1982, incorporate herein by 
reference). It is accepted that the relative hydropathic chiaracter of the amino acid contributes to 
5 the secondary structure of the resultant protein, which in turn defines the interaction of the 
protein with other molecules, for example, enzymes, substrates, receptors, DNA, antibodies, 
antigens, and the like. 

Each amino acid has been assigned a hydropathic index on the basis of their 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982), these are: isoleucine 

10 (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cystejne/cystine (+2.5); methionine 
(+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine 
(-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); 
asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other amino acids 

15 having a similar hydropathic index or score and still result in a protein with similar biological 
activity, i.e., still obtain a biological functionally equivalent protein. In making such changes, 
the substitution of amino acids whose hydropathic indices are within ±2 is preferred, those which 
are within ±1 are particularly preferred, and those within +0.5 are even more particularly 
preferred. 

20 It is also understood in the art that the substitution of like amino acids can be made ^ 

effectively on the basis of hydrophilicity. U. S. Patent 4,554,101, incorporated herein by 
reference, states that the greatest local average hydrophilicity of a protein, as governed by the 
hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein. 

As detailed in U. S. Patent 4,554,101, the following hydrophilicity values have been 

25 assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); glutamate 
(+3.0 ± 1); serine (+0.3); asparagine (+0,2); glutamine (+0.2); glycine (0); threonine (-0.4); 
proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (- 
1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). 

It is understood that an amino acid can be substituted for another having a similar 

30 hydrophilicity value and still obtain a biologically equivalent, and in particular, an 
immunologically equivalent protein. In such changes, the substitution of amino acids whose 
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hydrophilicity values are within ±2 is preferred, those which are within ±1 are particularly 
preferred, and those within ±0.5 are even more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based on the relative 
similarity of the amino acid side-chain substituents, for example, their hydrophobicity, 
5 hydrophilicity, charge, size, and the like. Exemplary substitutions which take various , of the 
foregoing characteristics into consideration are well known to those of skill in the art and 
include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine and 
asparagine; and valine, leucine and isoleucine. 
2.12 Insecticidal Compositions and Methods of Use 

10 The inventors contemplate that the crystal protein compositions disclosed herein will find 

particular utility as insecticides for topical and/or systemic application to field crops, grasses, 
fruits and vegetables, and ornamental plants. In a preferred embodiment, the bioinsecticide 
composition comprises an oil flowable suspension of bacterial cells which expresses a novel 
' crystal protein disclosed herein. Preferably the cells are B. thuringiensis NRRL B-2 1 92 1 , NRRL 

15 B-21922, NRRL B-21923, NRRL B-21924, NRRL B-21925, NRRL B-21926, NRRL B-21927, 
NRRL B.21928, NRRL B-21929, NRRL B-21930, NRRL B-21931, NRRL B-21932, NRRL 
B-21933, NRRL B-21934, NRRL B-21935, NRRL B-21936, NRRL B-21937, NRRL B-21938, 
NRRL B-21939,NRRL B-21940, NRRL B-21941, NRRL B-21942, NRRL B-21943, and NRRL 
B-2 1944, however, any such bacterial host cell expressing the novel nucleic acid segments 

20 disclosed herein and producing a crystal protein is contemplated to be useful, such as 
B. thuringiensis, B. megaterium, B. subtilis, E. coli, or Pseudomonas spp. 

In another important embodiment, the bioinsecticide composition comprises a water 
dispersible granule. This granule comprises bacterial cells which expresses a novel crystal 
protein disclosed herein. Preferred bacterial cells are B. thuringiensis NRRL B-2 1921, NRRL 

25 B-21922, NRRL B-21923, NRRL B-21924, NRRL B-21925, NRRL B-21926, NRRL B-21927. 
NRRL B-21928, NRRL B-21929, NRRL B-21930, NRRL B-21931, NRRL B.21932, NRRL 
B-21933, NRRL B-21934, NRRL B-21935, NRRL B-21936, NRRL B-21937, NRRL B-21938, 
NRRL B-2 193 9, NRRL B-21940, NRRL B-21941, NRRL B-21942, NRRL B-21943, and NRRL 
B-2 1944, however, bacteria such as B. thuringiensis, B. megaterium, B. subtilis, E. coli, or 
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10 



IS 



20 



25 



Pseudomonas spp. cells transformed with a DNA segment disclosed herein and expressing the 
crystal protein are also contemplated to be useful. 

In a third important embodiment, the bioinsecticide composition comprises a wettable 
powder, dust, pellet, or collodial concentrate. This powder comprises bacterial cells which 
expresses a novel crystal protein disclosed herein. Preferred bacterial cells are B. thuringiensis 
NRRL B-21921, NRRL B-2I922, NRRL B-21923, NRRL B-21924, NRRL B-21925, NRRL 
B-21926, NRRL B-21927, NRRL B-21928, NRRL B-21929, NRRL B-21930, NRRL B-21931, 
NRRL B-21932, NRRL B-21933, NRRL B-21934, NRRL B-21935, NRRL B-21936, NRRL 
B-21937, NRRL B-21938, NRRL B-21939, NRRL B-21940, NRRL B-21941, NRRL B-21942, ) 
NRRL B-21943, and NRRL B-21944 cells, however, bacteria such as B. thuringiensis^ B. 
megaterium, B. subtilis, E. coli, or Pseudomonas spp. cells transformed with a DNA segment 
disclosed herein and expressing the crystal protein are also contemplated to be useful. Such dry 
forms of the insecticidal compositions may be formulated to dissolve immediately upon wetting, 
or alternatively, dissolve in a controlled-release, sustained-release, or other time-dependent 
manner. 

In a fourth important embodiment, the bioinsecticide composition comprises an aqueous 
suspension of bacterial cells such as those described above which express the crystal protein. 
Such aqueous suspensions may be provided as a concentrated stock solution which is diluted 
prior to application, or alternatively, as a diluted solution ready-to-apply. 

For these methods involving application of bacterial cells, the cellular host containing the 



crystal protein gene(s) may be grown in any convenient nutrient medium, where the DNA 
construct provides a selective advantage, providing for a selective medium so that substantially 
all or all of the cells retain the B. thuringiensis gene. These cells may then be harvested in 
accordance with conventional ways. Alternatively, the cells can be treated prior to harvesting. 

When the insecticidal compositions comprise intact B. thuringiensis cells expressing the 
protein of interest, such bacteria may be formulated in a variety of ways. They may be employed 
as wettable powders, granules or dusts, by mixing with various diluents, inert materials, such as 
inorganic minerals (phyllosilicates, carbonates, sulfates, phosphates, and the like) or botanical 
materials (powdered corncobs, rice hulls, walnut shells, and the like).j The formulations may 
include spreader-sticker adjuvants, stabilizing agents, other pesticidal additives, or surfactants. 
Liquid formulations may be aqueous-based or non-aqueous and employed as foams, suspensions. 
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emulsifiable concentrates, or the like. The ingredients may include rheological agents, 
surfactants, emulsifiers, dispersants, or polymers. 

Alternatively, the novel insecticidal polypeptides may be prepared by native or 
recombinant bacterial expression systems in vitro and isolated for subsequent field application. 
Such protein may be either in crude cell lysates, suspensions, colloids, etc., or alternatively may 
be purified, refined, buffered, and/or ftirther processed, before formulating in an active biocidal 
formulation. Likewise, under certain circumstances, it may be desirable to isolate crystals and/or 
spores from bacterial cultures expressing the crystal protein and apply solutions, suspensions, or 
collodial preparations of such crystals and/or spores as the active bioinsecticidal composition. 

Regardless of the method of application, the amount of the active component(s) is applied 
at an insecticidally-effective amount, which will vary depending on such factors as, for example, 
the specific coleopteian insects to be controlled, the specific plant or crop to be treated, the 
environmental conditions, and the method, rate, and quantity of application of the insecticidally- 
active composition. 

The insecticide compositions described may be made by formulating either the bacterial 
cell, crystal and/or spore suspension, or isolated protein component with the desired 
agriculturally-acceptable carrier. The compositions may be formulated prior to administration in 
an appropriate means such as lyophilized, freeze-dried, dessicated, or in an aqueous carrier, 
medium or suitable diluent, such as saline or other buffer. The formulated compositions may be 
in the form of a dust or granular material, or a suspension in oil (vegetable or mineral), or water 
or oil/water emulsions, or as a wettable powder, or in combination with any other carrier material 
suitable for agricultural application. Suitable agricultural carriers can be solid or liquid and are 
well known in the art. The term "agriculturally-acceptable carrier" covers all adjuvants, £1 coli, 
inert components, dispersants, surfactants, tackifiers, binders, etc. that are ordinarily used in 
insecticide formulation technology; these are well known to those skilled in insecticide 
formulation. The formulations may be mixed with one or more solid or liquid adjuvants and 
prepared by various means, E. coli, by homogeneously mixing, blending and/or grinding the 
insecticidal composition with suitable adjuvants using conventional formulation techniques. 

The insecticidal compositions of this invention are applied to the environment of the 
target lepidopteran insect, typically onto the foliage of the plant or crop to be protected, by 
conventional methods, preferably by spraying. The strength and duration of insecticidal 
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application will be set with regard to conditions specific to the particular pest(s), crop(s) to be 
treated and particular environmental conditions. The proportional ratio of active ingredient to 
carrier will naturally depend on the chemical nature, solubility, and stability of the insecticidal 
composition, as well as the particular formulation contemplated. 

Other application techniques, including dusting, sprinkling, soaking, soil injection, seed 
coating, seedling coating, spraying, aerating, misting, atomizing, and the like, are also feasible 
and may be required under certain .circumstances such as e.g.. insects that cause root or stalk 
infestation, or for application to delicate vegetation or ornamental plants. These application 
procedures are also well-known to those of skill in the art. ' 

The insecticidal composition of the invention may be employed in the method of the 
invention singly or in combination with other compounds, including and not limited to other 
pesticides. The method of the invention may also be used in conjunction with other treatments 
such as surfactants, detergents, polymers or time-release : formulations. The insecticidal 
compositions of the present invention may be formulated for either systemic or topical use. 

The concentration of insecticidal composition which is used for environmental, systemic, 
or foliar application will vary widely depending upon the nature of the particular formulation, 
means of application, environmental conditions, and degree of biocidal activity. Typically, the 
bioinsecticidal composition will be present in the applied formulation at a concentration of at 
least about 1% by weight and may be up to and including about 99% by weight. Dry 
formulations of the polypeptide compositions may be from about 1% to about 99% or more by 
weight of the protein composition, while liquid formulations may generally comprise from about 
1% to about 99% or more of the active ingredient by weight. Formulations which comprise 
intact bacterial cells will generally contain from about 10'' to about lO' cells/mg. 

The insecticidal formulation may be administered to a particular plant or target areia in 
one or more applications as needed, with a typical field application rate per hectare ranging on 
the order of from about 50 g to about 500 g of active ingredient, or of from about 500 g to about 
1000 g, or of from about 1 000 g to about 5000 g or more of active ingredient. 
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5.0 Description of Illustrative Embodiments 

5.1 Some Advantages of the Invention 

The use of B. thuringiensis insecticidal crystal protein genes for in planta production of 
insecticidal proteins, thereby conferring insect resistance on important agronomic plants, is 

5 rapidly gaining commercial acceptance in the United States and abroad. The need for new 
insecticidal traits does not diminish, however, with the successful deployment of a handful of cry 
genes in plants. Concerns over the potential for insect resistance development, for instance, 
makes it imperative that an arsenal of insecticidal proteins (i.e. cry genes) be assembled to 
provide the genetic material necessary for tomorrow's insecticidal 'traits. In addition, transgenic 

10 plants producing a B. thuringiensis Cry protein may still be susceptible to damage from 
secondary insect pests, thus prompting the search for additional Ciy proteins with improved 
efficacy towards these pests. The B. thuringiensis crystal proteins of the present invention 
represent a diverse collection of insecticidal proteins, including several that are toxic towards a 
lepidopteran colony exhibiting resistance to certain types of Cryl proteins. Bioassays against a 

IS wide range of lepidopteran pests confirm that these proteins possess insecticidal activity and, 
furthermore, that these proteins vary in efficacy against this array of target insects. This 
variation in the spectrum of insects affected by the toxin proteins suggests differing modes of 
action that may be important for future insect resistance management strategies. In planta 
expression of the cry genes of the present invention can confer insect resistance to the host plant 
20 as has been demonstrated for other cry genes from B. thuringiensis. 

5.2 Probes and Primers 

In another aspect, DNA sequence information provided by the invention allows for the 
preparation of relatively short DNA (or RNA) sequences having the ability to specifically 
hybridize to gene sequences of the selected polynucleotides disclosed herein. In these aspects, 

25 nucleic acid probes of an appropriate length are prepared based on a consideration of a selected 
crystal protein gene sequence, e.g., a sequence such as that shown in SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID N0:9, SEQ ID N0:1 1, SEQ ID NO: 13, SEQ ID 
NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID N0:21, SEQ ID NO:23, SEQ ID NO:25, SEQ 
ID NO:27, SEQ ID NO:29, SEQ ID N0:31, SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, 

30 SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID 



BNSDOCIO; <WO_0119859A2_L> 



wo 01/19859 



PCT/USOO/25361 



-29- 

r 

NO:49 and SEQ ID NO:62. The ability of such DNAs and nucleic acid probes to specifically 
hybridize to a crystal protein-encoding gene sequence lends them particular utility in a variety of 
embodiments. Most importantly, the probes may be used in a variety of assays for detecting the 
presence of complementary sequences in a given sample. 

5 In certain embodiments, it is advantageous to use oligonucleotide primers. The sequence 

of such primers is designed using a polynucleotide of the present invention for use in detecting, 
amplifying or mutating a defined segment of a crystal protein gene from B. thuhngiensis using 
PGR™ technology. Segments of related crystal protein genes from other species may also be 
amplified by PGR™ using such primers. } 

10 To provide certain of the advantages in accordance with the present invention, a preferred 

nucleic acid sequence employed .for hybridization studies or assays includes sequences that are 
complementary to at least a 14 to 30 or so long nucleotide stretch of a crystal protein-encoding 
sequence, such as that shown in SEQ ID NO:l, SEQ ID NO:3, SEQ ID N0:5, SEQ ID N0:7, 
SEQ ID N0:9, SEQ ID NO:ll, SEQ ID N0:13, SEQ ID NO:15, SEQ ID N0:17, SEQ ID 

15 NO:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ 
ID NO:31, SEQ ID NO:33. SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:41. 
SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49 and SEQ ID NO:62. A size of 
at least about 14 or so nucleotides in length helps to ensure that the fragment will be of sufficient 
length to form a duplex molecule that is both stable and selective. Molecules having 

20 complementary sequences over stretches greater than about 14 or so bases in length are generally ^ 
preferred, though, in order to increase stability and selectivity of the hybrid, and thereby improve 
the quality and degree of specific hybrid molecules obtained. One will generally prefer to design 
nucleic acid molecules having gene-complementaiy stretches of about 14 to about 20 or so 
nucleotides, or even longer where desired. Such fragments may be readily prepared by, for 

25 example, directly synthesizing the fragment by chemical means, by application of nucleic acid 
reproduction technology, such as the PGR™ technology of U. S. Patents 4,683,195, and 
4,683,202, herein incorporated by reference, or by excising selected DNA fragments from 
recombinant plasmids containing appropriate inserts and suitable restriction sites, 

5.3 Expression Vectors 

30 The present invention contemplates an expression vector comprising a polynucleotide of 

the present invention. Thus, in one embodiment an expression vector is an isolated and purified 
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DNA molecule comprising a promoter operatively linked to an coding region that encodes a 
polypeptide of the present invention, which coding region is operatively linked to a transcription- 
terminating region, whereby the promoter drives the transcription of the coding region. 

As used herein, the term "operatively linked" means that a promoter is connected to an 
coding region in such a way that the transcription of that coding region is controlled and 
regulated by that promoter. Means for operatively linking a promoter to a coding region are well 
known in the art. 

In a preferred embodiment, the recombinant expression of DNAs encoding the crystal 
proteins of the present invention is preferable in a Bacillus host cell. Preferred host cells include 
B. thuringiensis, B. megaterium, B. subiilis, and related bacilli, with B. Ihuringiensis host cells 
being highly preferred. Promoters that function in bacteria are well-known in the art. An 
exemplary and preferred promoter for the Bacillus crystal proteins include any of the known 
crystal protein gene promoters, including the cryETSI, cryET40, cryET43, cryET44, cryET45, 
cryET46, cryET47. cryET49. cryETSl. cryET52. cryET53. cryET54, cryET55, cryET56. 
cryET57, cryET59. cryET60, cryET61, cryET62, cryET63. cryET64. cryET66, cryET67, 
cryET68, cryET72, cryET73, and cryET83 gene promoters. Alternatively, mutagenized or 
recombinant crystal protein-encoding gene promoters may be engineered by the hand of man and 
used to promote expression of the novel gene segments disclosed herein. 

In an altemate embodiment, the recombinant expression of DNAs encoding the crystal 
proteins of the present invention is performed using a transformed Gram-negative bacterium 
such as an £. colt or Pseudomonas spp. host cell. Promoters which function in high-level 
expression of target polypeptides in E. coli and other Gram-negative host cells are also well- 
known in the art. 

Where an expression vector of the present invention is to be used to transform a plant, a 
promoter is selected that has the ability to drive expression in plants. Promoters that function in 
plants are also well known in the art. Useful in expressing the polypeptide in plants are 
promoters that are inducible, viral, synthetic, constitutive as described (Poszkowski et al, 1989; 
Odell et al, 1985), and temporally regulated, spatially regulated, and spatio-temporally regulated 
(Chaue/fl/., 1989). 

A promoter is also selected for its ability to direct the transformed plant cell's or 
transgenic plant's transcriptional activity to the coding region. Structural genes can be driven by 
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a variety of promoters in plant tissues. Promoters can be near-constitutive, such as the CaMV 
35S promoter, or tissue-specific or developmentally specific promoters affecting dicots or 
monocots. 

Where the promoter is a near-constitutive promoter such as CaMV 35S, increases in 
5 polypeptide expression are found in a variety of transformed plant tissues {e.g., callus, leaf, seed 
and root). Alternatively, the effects of transformation can be directed to specific plant tissues by 
using plant integrating vectors containing a tissue-specific promoter. 

An exemplary tissue-specific promoter is the lectin promoter, which is specific for seed 
tissue. The Lectin protein in soybean seeds is encoded by a single gene {LeJ) that is only 
10 expressed during seed maturation and accounts for about 2 to about 5% of total seed mRNA. 
The lectin gene and seed-specific promoter have been fully characterized and used to direct seed 
specific expression in transgenic tobacco plants (Vodkin e/ al., 1983; Lindstrom et al, 1990.) 

An expression vector containing a coding region that encodes a polypeptide of interest is 
engineered to be under control of the lectin promoter and that vector is introduced into plants 
15 using, for example, a protoplast transformation method (Dhir et al, 1991). The expression of the 
polypeptide is directed specifically to the seeds of the transgenic plant. 

A transgenic plant of the present invention produced from a plant cell transformed with a 
tissue specific promoter can be crossed with a second transgenic plant developed fi'om a plant 
cell transformed with a different tissue specific promoter to produce a hybrid transgenic plant 
20 that shows the effects of transformation in more than one specific tissue. 

Exemplary tissue-specific promoters are com sucrose synthetase 1 (Yang et al., 1990), 
com alcohol dehydrogenase 1 (Vogel et al, 1989), com light harvesting complex (Simpson, 
1986), com heat shock protein (Odell et al, 1985), pea small subunit RuBP carboxylase 
(Poulsen et al, 1986; Cashmore et al, 1983), Ti plasmid mannopine synthase (Langridge et al, 
25 1989), Ti plasmid nopaline synthase (Langridge et al, 1989), petunia chalcone isomerase (Van 
Tunen et al, 1988), bean glycine rich protein 1 (Keller et al, 1989), CaMV 35s transcript (Odell 
et al, 1985) and Potato patatin (Wenzler et al, 1989), Preferred promoters include a cauliflower 
mosaic virus (CaMV 35S) promoter, a S-E9 small subunit RuBP carboxylase promoter, a rice 
actin promoter, a maize histone promoter, a fused CaMV 35S-Arabidopsis histone promoter, a 
30 CaMV 35 S promoter, a CaMV 19S promoter, a nos promoter, an Adh promoter, an actin 
promoter, a histone promoter, a ribulose bisphosphate carboxylase promoter, an R-allele 
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promoter, a root cell promoter, an a-tubulin promoter, an ABA-inducible promoter, a turgor- 
inducible promoter, a rbcS promoter, a com sucrose synthetase 1 promoter, a com alcohol 
dehydrogenase 1 promoter, a com light harvesting complex promoter, a com heat shock protein 
promoter, a pea small subunit RuBP carboxylase promoter, a Ti plasmid maimopine synthase 
5 promoter, a Ti plasmid nopaline synthase promoter, a petunia chalcone isomerase promoter, a 
bean glycine rich protein 1 promoter, a CaNfV 35s transcript promoter, a potato patatin promoter, 
a cab promoter, a PEP-Carboxylase promoter and an S-E9 small subunit RuBP carboxylase 
promoter. 

The choice of which expression vector and ultimately to which promoter a polypeptide 

10 coding region is operatively linked depends directly on the functional properties desired, e.g., the 
location and timing of protein expression, and the host cell to be transformed. These are well 
known limitations inherent in the art of constructing recombinant DNA molecules. However, a 
vector useful in practicing the present invention is. capable of directing the expression of the 
polypeptide coding region to which it is operatively linked. 

15 Typical vectors useful for expression of genes in higher plants are well known in the art 

and include vectors derived from the tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens 
described (Rogers et al., 1987). However, several other plant integrating vector systems are 
knovwi to function in plants including pCaMVCN transfer control vector described (Fromm et 
al., 1985). Plasmid pCaMVCN (available from Pharmacia, Piscataway, NJ) includes the 

20 cauliflower mosaic virus CaMV 35S promoter. 

In preferred embodiments, the vector used to express the polypeptide includes a selection 
marker that is effective in a plant cell, preferably a drug resistance selection marker. One 
preferred drug resistance marker is the gene whose expression results in kanamycin resistance; 
i.e., the chimeric gene containing the nopaline synthase promoter, Tn5 neomycin 

25 phosphotransferase II (nptJl) and nopaline synthase 3' non-translated region described (Rogers et 
al., 1988). 

RNA polymerase transcribes a coding DNA sequence through a site where 
polyadenylation occurs. Typically, DNA sequences located a few himdred base pairs 
downstream of the polyadenylation site serve to terminate transcription. Those DNA sequences 
30 are referred to herein as transcription-termination regions. Those regions are required for 
efficient polyadenylation of transcribed messenger RNA (mRNA). 
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I I 

Means for preparing expression vectors are well known in the art. Expression 
(transformation vectors) used to transform plants and methods of making those vectors are 
described in U. S. Patents 4,971,908, 4,940,835, 4,769,061 and 4,757,011, the disclosures of 
which are incorporated herein by reference. Those vectors can be modified to include a coding 
sequence in accordance with the present invention. 

A variety of methods has been developed to operatively link DNA to vectors via 
complementary cohesive termini or blunt ends. For instance, complementary homopolymer 
tracts can be added to the DNA segment to be inserted and to the vector DNA. The vector and 
DNA segment are then joined by hydrogen bonding between the complementary homopolymeric 
tails to form recombinant DNA molecules. , 

A coding region that encodes a polypeptide having the ability to confer insecticidal 
activity to a cell is preferably a CryET31, CryET40, CryET43, CryET44, CTyET45, CryET46, 
CryET47, CryET49, CryETSl, CryET52, CryET53, CryET54, CryETSS, CryET56, CryET57, 
CryET59, CryET60, CryET61, CryET62, CryET63, CryET64, CryET66, CryET67, CryET68, 
CryET72, CryET73, and CryET83 polypeptide-encbding gene. 

5.7 Nomenclature of the Novel Polypeptides 

The inventors have arbitrarily assigned the designation CryET31, CryET40, CryET43, 
CryET44, CryET45, CryET46, CTyET47, CryET49, CryETSl, CryET52, CryET53, CryET54, 
CryET56, CryET57, CryET59, CryET60, CryET61. CryET62, CryET63, CryET64, CryET66, 
CryET67, CryET68, CryET72, CiyET73, and CryET83 to the polypeptides of this invention. 
Likewise, the arbitrary designations of cryETSl. cryET40. cryET43. cryET44. cryET45. 
cryET46. cryET47, cryET49, cryETSl, cryET52. cryET53. cryET54, cryET56, cryET57, 
cryET59. cryET60. cryET61, cryET62, cryET63, cryET64, cryET66. cryET67, cryET68, 
cryET72. cryET73, and cryET83 have been assigned to the novel nucleic acid sequence which 
encodes these polypeptides, respectively. Formal assignment of gene and protein designations 
based on the revised nomenclature of crystal protein endotoxins will be assigned by a committee 
on the nomenclature of B. thuringiensis, formed to systematically classify B. thuringiensis 
crystal proteins. The inventors contemplate that the arbitrarily assigned designations of the 
present invention will be superceded by the official nomenclature assigned to these sequences. 
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5.8 Transformed Host Cells and Transgenic Plants 

Methods and compositions for transfonning a bacterium, a yeast cell, a plant cell, or an 
entire plant with one or more expression vectors comprising a crystal protein-encoding gene 
segment are further aspects of this disclosure. A transgenic bacterium, yeast cell, plant cell or 
plant derived from such a transformation process or the progeny and seeds from such a 
transgenic plant are also further embodiments of the invention. 

Means for transforming bacteria and yeast cells are well known in the art. Typically, 
means of transformation are similar to those well known means used to transform other bacteria 
or yeast such as E. coli or Saccharomyces cerevisiae. Methods for DNA transformation of plant 
cells include Agrobacterium-mediated plant transformation, protoplast transformation, gene 
transfer into pollen, injection into reproductive organs, injection into immature embryos and 
particle bombardment. Each of these methods has distinct advantages and disadvantages. Thus, 
one particular method of introducing genes into a particular plant strain may not necessarily be 
the most effective for another plant strain, but it is well known which methods are useful for a 
particular plant strain. 

There are many methods for introducing transforming DNA segments into cells, but not 
all are suitable for delivering DNA to plant cells. Suitable methods are believed to include 
virtually any method by which DNA can be introduced into a cell, such as by Agrobacterium 
infection, direct delivery of DNA such as, for example, by PEG-mediated transformation of 
protoplasts (Omirulleh et al, 1993), by desiccation/inhibition-mediated DNA uptake, by 
electroporation, by agitation with silicon carbide fibers, by acceleration of DNA coated particles, 
etc. In certain embodiments, acceleration methods are preferred and include, for example, 
microprojectile bombardment and the like. 

Technology for introduction of DNA into cells is well-known to those of skill in the art. 
Four general methods for delivering a gene into cells have been described: (1) chemical methods 
(Graham and van der Eb, 1973; Zatloukal et al., 1992); (2) physical methods such as 
microinjection (Capecchi, 1980), electroporation (Wong and Neumann, 1982; Fromm et al., 
1985; U. S. Patent No. 5,384,253) and the gene gun (Johnston and Tang, 1994; Fynan et al., 
1993); (3) viral vectors (Clapp, 1993; Lu et al., 1993; Eglitis and Anderson, 1988a; 1988b); and 
(4) receptor-mediated mechanisms (Curiel etal., 1991; 1992; Wagner e/ a/., 1992). 
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5.8J Agrobacierium-Mediattd Transfer 

Agrobacterium-mcdiated transfer is a widely applicable system for introducing genes into 
plant cells because the DNA can be introduced into whole plant tissues, thereby bypassing the 
need for regeneration of an intact plant from a protoplast. The use of Agrobacterium-mediated 

5 plant integrating vectors to introduce DNA into plant cells is well known in the art. See, for 
example, the methods described (Fraley et al, 1985; Rogers et al., 1987). Further, the 
integration of the Ti-DNA is a relatively precise process resulting in few rearrangements. The 
region of DNA to be transferred is defined by the border sequences, and intervening DNA is 
usually inserted into the plant genome as described (Spielmaim ft al., 1986; Jorgensen et al., 

10 1987). 

Modem Agrobacterium transformation vectors are capable of replication in E. coli as 
well as Agrobacterium, allowing for convenient manipulations as described (Klee et al., 1985). 
Moreover, recent technological advances in vectors for Agrobacterium-med\aied gene transfer 
have improved the arrangement of genes and restriction sites in the vectors to facilitate 

15 construction of vectors capable of expressing various polypeptide coding genes. The vectors 
described (Rogers et al, 1987), have convenient multi-linker regions flanked by a promoter and 
a polyadenylation site for direct expression of inserted polypeptide coding genes and are suitable 
for present purposes. In addition, Agrobacterium containing both armed and disarmed Ti genes 
can be used for the transformations. In those plant strains where Agrobacterium-mediated 

20 transformation is efficient, it is the method of choice because of the facile and defined nature of 
the gene transfer. 

Agrobacterium-mediaied transformation of leaf disks and other tissues such as 
cotyledons and hypocotyls appears to be limited to plants that Agrobacterium naturally infects. 
Agrobacterium-mediated transformation is most efficient in dicotyledonous plants. Few 

25 monocots appear to be natural hosts for Agrobacterium, although transgenic plants have been 
produced in asparagus using Agrobacterium vectors as described (Bytebier et al., 1987). 
Therefore, commercially important cereal grains such as rice, com, and wheat must usually be 
transformed using alternative methods. However, as mentioned above, the transformation of 
asparagus using Agrobacterium can also be achieved (see, for example, Bytebier et al., 1987). 

30 A transgenic plant formed using Agrobacterium transformation methods typically 

contains a single gene on one chromosome. Such transgenic plants can be referred to as being 
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heterozygous for the added gene. However, inasmuch as use of the word "heterozygous" usually 
implies the presence of a complementary gene at the same locus of the second chromosome of a 
pair of chromosomes, and there is no such gene in a plant containing one added gene as here, it is 
believed that a more accurate name for such a plant is an independent segregant, because the 

5 added, exogenous gene segregates independently during mitosis and meiosis. 

More preferred is a transgenic plant that is homozygous for the added structural gene; 
a transgenic plant that contains two added genes, one gene at the same locus on each 
chromosome of a chromosome pair. A homozygous transgenic plant can be obtained by sexually 
mating (selfmg) an independent segregant transgenic plant that contains a single added gene, 

10 germinating some of the seed produced and analyzing the resulting plants produced for enhanced 
carboxylase activity relative to a control (native, non-transgenic) or an independent segregant 
transgenic plant. 

It is to be understood that two different transgenic plants can also be mated to 
produce offspring that contain two independently segregating added, exogenous genes. Selfing 
15 of appropriate progeny can produce plants that are homozygous for both added, exogenous genes 
that encode a polypeptide of interest. Back-crossing to a parental plant and out-crossing vsdth a 
non-transgenic plant are also contemplated. 

Transformation of plant protoplasts can be achieved using methods based on calcium 
phosphate precipitation, polyethylene glycol treatment, electroporation, and combinations of 
20 these treatments (see, e.g.. Potrykus et al., 1985; Loiz et al., 1985; Fromm el ai, 1985; Uchimiya 
etai, 1986; Callis etal, 1987; Marcotte et al, 1988). 

Application of these systems to different plant strains depends upon the ability to 
regenerate that particular plant strain from protoplasts. Illustrative methods for the regeneration 
of cereals from protoplasts are described (Fujimura et al., 1985; Toriyama et ai, 1986; Yamada 
25 et al. , 1 986; Abdullah et al. , 1 986). 

5.8.4 Other Transformation Methods 

Transformation of plant protoplasts can be achieved using methods based on calcium 
phosphate precipitation, polyethylene glycol treatment, electroporation, and combinations of 
these treatments (see, e.g., Potrykus et al, 1985; Lorz et al, 1985; Fromm et al, 1985; Uchimiya 
30 et al, 1986; Callis et al, 1987; Marcotte et al, 1988). 
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Illustrative methods for the regeneration of cereals from protoplasts are described 
(Fujimura et al, 1985; Toriyama et aL, 1986; Yamada el al, 1986; Abdullah et al., 1986). 

5.8.5 Gene Expression in Plants 

Although great progress has been made in recent years with respect to preparation of 
transgenic plants which express bacterial proteins such as B. thuringiensis crystal proteins, the 
results of expressing native bacterial genes in plants are often disappointing. In recent years, 
however, several potential factors have been implicated as responsible in varying degrees for the 
level of protein expression from a particular coding sequence. For example, scientists now know 
that maintaining a significant level of a particular mRNA in the cell is indeed a critical factor. 
Unfortunately, the causes for low steady state levels of mRNA encoding foreign proteins are 
many. First, full length RNA synthesis may not occur at a high frequency. This could, for 
example, be caused by the premature termination of RNA during transcription or due to 
imexpected mRNA processing during transcription. Second, full length RNA may be produced 
in the plant cell, but then processed (splicing, polyA addition) in the nucleus in a fashion that 
creates a nonfunctional mRNA. If the RNA is not properly synthesized, terminated and 
polyadenylated, it cannot move to the cytoplasm for translation. Similarly, in the cytoplasm, if 
mRNAs have reduced half lives (which are determined by their primary or secondary sequence) 
inisufficient protein product will be produced. In addition, there is an effect, whose magnitude is 
uncertain, of translational efficiency on mRNA half-life. In addition, every RNA molecule folds 
into a particular structure, or perhaps family of structures, which is determined by its sequence. 
The particular structure of any RNA might lead to greater or lesser stability in the cytoplasm. 
Structure per se is probably also a determinant of mRNA processing in the nucleus. It is likely 
that dramatically changing the sequence of an RNA will have a large effect on its folded 
structure It is likely that structure per se or particular structural features also have a role in 
determining RNA stability. 

To overcome these limitations in foreign gene expression, researchers have identified 
particular sequences and signals in RNAs that have the potential for having a specific effect on 
RNA stability. In certain embodiments of the invention, therefore, there is a desire to optimize 
expression of the disclosed nucleic acid segments in planta. One particular method of doing so, 
is by alteration of the bacterial gene to remove sequences or motifs which decrease expression in 
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a transfonned plant cell. The process of engineering a coding sequence for optimal expression in 
planta is often referred to as "plantizing" a DNA sequence. 

Particularly problematic sequences are those which are A+T rich. Unfortunately, since 
B. thuringiensis has an A+T rich genome, native crystal protein gene sequences must often be 
modified for optimal expression in a plant. The sequence motif ATTTA (or AUUUA as it 
appears in RNA) has been implicated as a destabilizing sequence in mammalian cell mRNA 
(Shaw and Kamen, 1986). Many short lived mRNAs have A+T rich 3' untranslated regions, and 
these regions often have the ATTTA sequence, sometimes present in multiple copies or as 
muhimers (e.^., ATTTATTTA;..). Shaw and Kamen showed that the transfer of the 3' end of an 
unstable mRNA to a stable RNA (globin or VAl) decreased' the stable RNA's half life 
dramatically. They fiirther showed that a pentamer of ATTTA had a profound destabilizing 
effect on a stable message, and that this signal could exert its effect whether it was located at the 
3' end or within the coding sequence. However, the number of ATTTA sequences and/or the 
sequence context in which they occur also appear to be important in determining whether they 
ftinction as destabilizing sequences. Shaw and Kamen showed that a trimer of ATTTA had 
much less effect than a pentamer on mRNA stability and a dimer or a monomer had no effect on 
stability (Shaw and Kamen, 1987). Note that multimers of ATTTA such as a pentamer 
automatically create an A+T rich region. This was shown to be a cytoplasmic effect, not nuclear. 
In other unstable mRNAs, the ATTTA sequence may be present in only a single copy, but it is 
often contained in an A+T rich region. From the animal cell data collected to date, it appears 
that ATTTA at least in some contexts is important in stability, but it is not yet possible to predict 
which occurrences of ATTTA are destabiling elements or whether any of these effects are likely 
to be seen in plants. 

Some studies on mRNA degradation in animal cells also indicate that RNA degradation 
may begin in some cases with nucleolytic attack in A+T rich regions. It is not clear if these 
cleavages occur at ATTTA sequences. There are also examples of mRNAs that have differential 
stability depending on the cell type in which they are expressed or on the stage within the cell 
cycle at which they are expressed. For example, histone mRNAs are stable during DNA 
synthesis but unstable if DNA synthesis is disrupted. The 3' end of some histone mRNAs seems 
to be responsible for this effect (Pandey and Marzlufif, 1987). It does not appear to be mediated 
by ATTTA, nor is it clear what controls the differential stability of this mRNA. Another 
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example is the differential stability of IgG mRNA in B lymphocytes during B cell maturation 
(Genovese and Milcarek, 1988). A final example is the instability of a mutant p-thallesemic 
globin mRNA. In bone marrow cells, where this gene is normally expressed, the mutant mRNA 
is unstable, while the wild-type mRNA is stable. When the mutant gene is expressed in HeLa or 
5 L cells in vitro, the mutant mRNA shows no instability (Lim et ai, 1988). These examples all 
provide evidence that mRNA stability can be mediated by cell type or cell cycle specific factors. 
Furthermore this type of instability is not yet associated with specific sequences. Given these 
uncertainties, it is not possible to predict which RNAs are likely to be unstable in a given cell. In 
addition, even the ATI l A- motif may act differentially depending on the nature of the cell in 
10 which the RNA is present. Shaw and Kamen (1987) have reported that activation of protein 
kinase C can block degradation mediated by ATTTA. 

The addition of a poly adenylate string to the 3' end is common to most eukaryotic 
mRNAs, both plant and animal. The currently accepted view of polyA addition is that the 
nascent transcript extends beyond the mature 3' terminus. Contained within this transcript are 
15 signals for polyadenylation and proper 3' end formation. This processing at the 3' end involves 
cleavage of the mRNA and addition of polyA to the mature 3' end. By searching for consensus 
sequences near the polyA tract in both plant and animal mRNAs, it has been possible to identify 
consensus sequences that apparently are involved in polyA addition and 3' end cleavage. The 
same consensus sequences seem to be important to both of these processes. These signals are 
20 typically a variation on the sequence AATAAA. In animal cells, some variants of this sequence O 
that are functional have been identified; in plant cells there seems to be an extended range of 
functional sequences (Wickens and Stephenson, 1984; Dean et al, 1986). Because all of these 
consensus sequences are variations on AATAAA, they all ar« A+T rich sequences. This 
sequence is typically found 15 to 20 bp before the polyA tract in a mature mRNA. Studies in 
25 animal cells indicate that this sequence is involved in both polyA addition and 3' maturation. 
Site directed mutations in this sequence can disrupt these functions (Conway and Wickens, 1988; 
Wickens et al, 1987). However, it has also been observed that sequences up to 50 to 100 bp 3' 
to the putative polyA signal are also required; i.e., a gene that has a normal AATAAA but has 
been replaced or disrupted downstream does not get properly polyadenylated (Gil and Proudfoot, 
30 1984; Sadofsky and Alwine, 1 984; McDevitt et al., 1984). That is, the polyA signal itself is not 
sufficient for complete and proper processing. It is not yet known what specific dov^mstream 
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sequences are required in addition to the polyA signal, or if there is a specific sequence that has 
this function. Therefore, sequence analysis can only identify potential polyA signals. 

In naturally occurring mRNAs that are normally polyadenylated, it has been observed 
that disruption of this process, either by altering the polyA signal or other sequences in the 
mRNA, profound effects can be obtained in the level of functional mRNA. This has been 
observed in several naturally occurring mRNAs, with results that are gene-specific so far. 

It has been shown that in natural mRNAs proper polyadenylation is important in mRNA 
accumulation, and that disruption of this process can effect mRNA levels significantly. 
However, insufficient knowledge exists to predict the effect of changes in a normal gene. In a 
heterologous gene, it is even harder to predict the consequences, however, it is possible that the 
putative sites identified are dysfunctional. That is, these sites may not act as proper polyA sites, 
but instead function as aberrant sites that give rise to unstable mRNAs. 

In animal cell systems, AATAAA is by far the most common signal identified in mRNAs 
upstream of the polyA, but at least four variants have also been found (Wickens and Stephenson, 
1984). In plants, not nearly so much analysis has been done, but it is clear that multiple 
sequences similar to AATAAA can be used. The plant sites in Table 2 called major or minor 
refer only to the study of Dean et al. (1986) which analyzed only three types of plant gene. The 
designation of polyadenylation sites as major or minor refers only to the frequency of their 
occurrence as functional sites in naturally occurring genes that have been analyzed. In the case 
of plants this is a very limited database. It is hard to predict with any certainty that a site 
designated major or minor is more or less likely to function partially or completely when found 
in a heterologous gene such as those encoding the crystal proteins of the present invention. 



Table 2 - Polyadenylation SrrES in Plant Genes 
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The present invention provides a method for preparing synthetic plant genes which genes 
express their protein product at levels significantly higher than the wild-type genes which were 
commonly employed in plant transformation heretofore. In another aspect, the present invention 
also provides novel synthetic plant genes which encode non-plant proteins. 

5 As described above, the expression of native B. thuringiensis genes in plants is often 

problematic. The nature of the coding sequences of B. thuringiensis genes distinguishes them 
from plant genes as well as many other heterologous genes expressed in plants. In particular, 
B. thuringiensis genes are very rich (-62%) in adenine (A) and thymine (T) while plant genes 
and most other bacterial genes which have been expressed in plants are on the order of 45-55% 

10 A+T. 

Due to the degeneracy of the genetic code and the limited number of codon choices for 
any amino acid, most of the "excess" A+T of the structural coding sequences of some Bacillus 
species are found in the third position of the codons. That is, genes of some Bacillus species 
have A or T as the third nucleotide in many codons. Thus A+T content in part can determine 

15 codon usage bias. In addition, it is clear that genes evolve for maximum function in the 
organism in which they evolve. This means that particular nucleotide sequences found in a gene 
from one organism, where they may play no role except to code for a particular stretch of amino 
acids, have the potential to be recognized as gene control elements in another organism (such as 
transcriptional promoters or terminators, polyA addition sites, intron splice sites, or specific 

20 mRNA degradation signals). It is perhaps surprising that such misread signals are not a more 
common feature of heterologous gene expression, but this can be explained in part by the 
relatively homogeneous A+T content (~50%) of many organisms. This A+T content plus the 
nature of the genetic code put clear constraints on the likelihood of occurrence of any particular 
oligonucleotide sequence. Thus, a gene from £ coli with a 50% A+T content is much less likely 

25 to contain any particular A+T rich segment than a gene from B. thuringiensis. 

Typically, to obtain high-level expression of the S-endotoxin genes in plants, existing 
structural coding sequence ("structural gene") which codes for the S-endotoxin are modified by 
removal of ATTTA sequences and putative polyadenylation signals by site directed mutagenesis 
of the DNA comprising the structural gene. It is most preferred that substantially all the 

30 polyadenylation signals and ATTTA sequences are removed although enhanced expression 
levels are observed wdth only partial removal of either of the above identified sequences. 
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Altemately if a synthetic gene is prepared which codes for the expression of the subject protein, 
codons are selected to avoid the ATTTA sequence and putative polyadenylation signals. For 
purposes of the present invention putative polyadenylation signals include, but are not 
necessarily limited to. AATAAA. AATAAT. AACCAA, ATATAA. AATCAA. ATACTA. 
5 ATAAAA. ATGAAA. AAGCAT. ATTAAT, ATACAT, AAAATA, ATTAAA, AATTAA, 
AATACA and CATAAA. In replacing the ATTTA sequences and polyadenylation signals, 
codons are preferably utilized which avoid the codons which are rarely found in plant genomes. 

The selected DNA sequence is scanned to identify regions with greater than four 
consecutive adenine (A) or thymine (T) nucleotides. The A+T regions are scamied for potential 
,0 plant polyadenylation signals. Although the absence of five or more consecutive A or T 
nucleotides eliminates most plant polyadenylation signals, if there are more than one of the 
minor polyadenylation signals identified within ten nucleotides of each other, then the nucleotide 
sequence of this region is preferably altered to remove these signals while maintaining the 
original encoded amino acid sequence. 
,5 The second step is to consider the about 15 to about 30 or so nucleotide residues 

surrounding the A+T rich region identified in step one. If the A+T content of the surrounding 
region is less than 80%, the region should be examined for polyadenylation signals. Alteration 
of the region based on polyadenylation signals is dependent upon (1) the number of 
polyadenylation signals present and (2) presence of a major plant polyadenylation signal. 
20 The extended region is examined for the presence of plant polyadenylation signals. The 

polyadenylation signals are removed by site-directed mutagenesis of the DNA sequence. The 
extended region is also examined for multiple copies of the ATTTA sequence which are also 

removed by mutagenesis. 

It is also preferred that regions comprising many consecutive A+T bases or G+C bases 
25 are disrupted since these regions are predicted to have a higher likelihood to form hairpin 
structure due to self-complementarity. Therefore, insertion of heterogeneous base pairs would 
reduce the likelihood of self-complementary secondary structure formation which are known to 
inhibit transcription and/or translation in some organisms. In most cases, the adverse effects may 
be minimized by using sequences which do not contain more than five consecutive A+T or G+C. 
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5.8.6 Synthetic Oligonucleotides for Mutagenesis 

When oligonucleotides are used in the mutagenesis, it is desirable to maintain the proper 
amino acid sequence and reading frame, without introducing common restriction sites such as 
BglU, HindUl, Sad, Kpn\, EcoJCL, Ncol, Pstl and SaR into the modified gene. These restriction 
5 sites are found in poly-linker insertion sites of many cloning vectors. Of course, the introduction 
of new polyadenylation signals, ATTTA sequences or consecutive stretches of more than five 
A+T or G+C, should also be avoided. The preferred size for the oligonucleotides is about 40 to 
about 50 bases, but fragments ranging from about 18 to about 100 bases have been utilized. In 
most cases, a minimum of about 5 to about 8 base pairs of homology to the template DNA on 

10 both ends of the synthesized fragment are maintained to insure proper hybridization of the primer 
to the template. The oligonucleotides should avoid sequences longer than five base pairs A+T or 
G+C. Codons used in the replacement of wild-type codons should preferably avoid the TA or 
CG doublet wherever possible. Codons are selected from a plant preferred codon table (such as 
Table 3 below) so as to avoid codons which are rarely found in plant genomes, and efforts should 

15 be made to select codons to preferably adjust the G+C content to about 50%. 

Regions with many consecutive A+T bases or G+C bases are predicted to have a higher 
likelihood to form hairpin structures due to self-complementarity. Disruption of these regions by 
the insertion of heterogeneous base pairs is preferred and should reduce the likelihood of the 
formation of self-complementary secondary structures such as hairpins which are known in some 

20 organisms to inhibit transcription (transcriptional terminators) and translation (attenuators). 1 
Alternatively, a completely synthetic gene for a given amino acid sequence can be 
prepared, with regions of five or more consecutive A+T or G+C nucleotides being avoided. 
Codons are selected avoiding the TA and CG doublets in codons whenever possible. Codon 
usage can be normalized against a plant preferred codon usage table (such as Table 3) and the 

25 G+C content preferably adjusted to about 50%. The resulting sequence should be examined to 
ensure that there are minimal putative plant polyadenylation signals and ATTTA sequences. 



BNSDOCID: <WO, 



0119859A2J_> 



wo 01/19859 



PCTAJSOO/25361 



-44- 

Table 3 - Preferred Codon Usage in Plants 
Amino Codon Percent Usage in Plants Amino Codon Percent Usage in Plants 
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Restriction sites found in commonly used cloning vectors are also preferably avoided. 
However, placement of several unique restriction sites throughout the gene is useful for analysis 
of gene expression or construction of gene variants. 

5 5.8.7 "Plantized" Gene Constructs 

The expression of a plant gene which exists in double-stranded DNA form involves 
transcription of messenger RNA (mRNA) from one strand of the DNA by RNA polymerase 
enzyme, and the subsequent processing of the mRNA primary transcript inside the nucleus. This 
processing involves a 3' non-translated region which adds polyadenylate nucleotides to the 3' 
0 end of the RNA. Transcription of DNA into mRNA is regulated by a region of DNA usually 
referred to as the "promoter." The promoter region contains a sequence of bases that signals 
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RNA polymerase to associate with the DNA and to initiate the transcription of mRNA using one 
of the DNA strands as a template to make a corresponding strand of RNA. 

A number of promoters which are active in plant cells have been described in the 
literature. These include the nopaline synthase (NOS) and octopine synthase (OCS) promoters 
5 (which are carried on tumor-inducing plasmids of A. tumefaciens), the Cauliflower Mosaic Virus 
(CaMV) 19S and 35 S promoters, the light-inducible promoter from the small subunit of ribulose 
bis-phosphate carboxylase (ssRUBISCO, a very abundant plant polypeptide) and the mannopine 
synthase (MAS) promoter (Velten et al, 1984; Velten and Schell, 1985). All of these promoters 
have been used to create various types of DNA constructs which have been expressed in plants 

10 (see e.g., Intl. Pat. Appl. Publ. Ser. No. WO 84/02913). 

Promoters which are known or are found to cause transcription of RNA in plant cells can 
be used in the present invention. Such promoters may be obtained from plants or plant viruses 
and include, but are not limited to, the CaMV35S promoter and promoters isolated from plant 
genes such as ssRUBISCO genes. As described below, it is preferred that the particular.. 

IS promoter selected should be capable of causing sufHcient expression to result in the production 
of an effective amount of protein. 

The promoters used in the DNA constructs (i.e. chimeric plant genes) of the present 
invention may be modified, if desired, to affect their control characteristics. For example, the 
CaMV35S promoter may be ligated to the portion of the ssRUBISCO gene that represses the 

20 expression of ssRUBISCO in the absence of light, to create a promoter which is active in leaves 
but not in roots. The resulting chimeric promoter may be used as described herein. For purposes 
of this description, the phrase "CaMV35S" promoter thus includes variations of CaMV35S 
promoter, e.g., promoters derived by means of ligation with operator regions, random or 
controlled mutagenesis, etc. Furthermore, the promoters may be altered to contain multiple 

25 "enhancer sequences" to assist in elevating gene expression. 

The RNA produced by a DNA construct of the present invention also contains a 5' non- 
translated leader sequence. This sequence can be derived from the promoter selected to express 
the gene, and can be specifically modified so as to increase translation of the mRNA. The 5* 
non-translated regions can also be obtained from viral RNA's, from suitable eukaryotic genes, or 

30 from a synthetic gene sequence. The present invention is not limited to constructs, as presented 
in the following examples, Rather, the non-translated leader sequence can be part of the 5' end 
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of the non-translated region of the coding sequence for the virus coat protein, or part of the 
promoter sequence, or can be derived from an unrelated promoter or coding sequence. In any 
case, it is preferred that the sequence flanking the initiation site conform to the translational 
consensus sequence rules for enhanced translation initiation reported by Kozak (1984). 

The cry DNA constructs of the present invention may also contain one or more modified 
or fully-synthetic structural coding sequences which have been changed to enhance the 
performance of the cry gene in plants. The structural genes of the present invention may 
optionally encode a fusion protein comprising an amino-terminal chloroplast transit peptide or 
secretory signal sequence. 

The DNA construct also contains a 3' non-translated region. The 3' non-translated region 
contains a polyadenylation signal which functions in plants to cause the addition of 
polyadenylate nucleotides to the 3' end of the viral RNA. Examples of suitable 3' regions are (1) 
the 3' transcribed, non-translated regions containing the polyadenylation signal of Agrobacterium 
tumor-inducing (Ti) plasmid genes, such as the nopaline synthase (NOS) gene, and (2) plant 
genes like the soybean storage protein (7S) genes and the small subunit of the RuBP carboxylase 
(E9) gene. 

5.9 Methods for Producing Insect-Resistant Transgenic Plants 

By transforming a suitable host cell, such as a plant cell, with a recombinant cryETSl, 
cryET40, cryET43. cryET44. cryET45, cryET46. cryET47, cryET49, cryETSl, cryET52. 
cryET53. cryET54, cryET56. cryETSl, cryEJ59, cryET60. cryET61, cryET62. cryET63. 
cryET64. cryET66, cryET67. cryET68. cryET72. cryET73. and cryET83 gene-containing 
segment, the expression of the encoded crystal protein (i.e., a bacterial crystal protein or 
polypeptide having insecticidal activity against coleopterans) can resuU in the formation of 
insect-resistant plants. 

By way of example, one may utilize an expression vector containing a coding region for a 
B. thuringiensis crystal protein and an appropriate selectable marker to transform a suspension of 
embryonic plant cells, such as wheat or com cells using a method such as particle bombardment 
(Maddock et a/., 1991; Vasil et al, 1992) to deliver the DNA coated on microprojectiles into the 
recipient cells. Transgenic plants are then regenerated from transformed embryonic calli that 
express the insecticidal proteins. 
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The formation of transgenic plants may also be accomplished using other methods of cell 
transformation which are known in the art such as Agrobacierium-mediated DNA transfer 
(Fraley et al, 1983). Alternatively, DNA can be introduced into plants by direct DNA transfer 
into pollen (Zhou et al., 1983; Hess, 1987; Luo et al., 1988), by injection of the DNA into 
5 reproductive organs of a plant (Pena et al., 1987), or by direct injection of DNA into the cells of 
immature embryos followed by the rehydration of desiccated embryos G^euhaus et al., 1987; 
Benbrooke/a/., 1986). 

The regeneration, development, and cultivation of plants from single plant protoplast 
transformants or from various transformed explants is well known in the art (Weissbach and 

10 Weissbach, 1988). This regeneration and: growth process typically includes the steps of selection 
of transformed cells, culturing those individualized cells through the usual stages of embryonic 
development through the rooted plantlet stage. Transgenic embryos and seeds are similarly 
regenerated. The resulting transgenic rooted shoots are thereafter planted in an appropriate plant 
growth medium such as soil. 

IS The development or regeneration of plants containing the foreign, exogenous gene that 

encodes a polypeptide of interest introduced by Agrobacterium from leaf explants can be 
achieved by methods well known in the art such as described (Horsch et al., 1985). In this 
procedure, transformants are cultured in the presence of a selection agent and in a medium that 
induces the regeneration of shoots in the plant strain being transformed as described (Fraley et 

20 al., 1983). 

This procedure typically produces shoots within two to four months and those shoots are 
then transferred to an appropriate root-inducing medium containing the selective agent and an 
antibiotic to prevent bacterial growth. Shoots that rooted in the presence of the selective agent to 
form plantlets are then transplanted to soil or other media to allow the production of roots. These 
25 procedures vary depending upon the particular plant strain employed, such variations being well 
known in the art. 

Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic 
plants, as discussed before. Otherwise, pollen obtained from the regenerated plants is crossed to 
seed-grown plants of agronomically important, preferably inbred lines. Conversely, pollen from 
30 plants of those important lines is used to pollinate regenerated plants. A transgenic plant of the 
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present invention containing a desired polypeptide is cultivated using methods well known to 
one skilled in the art. 

A transgenic plant of this invention thus has an increased amount of a coding region (e.g., 
a cryETSl. cryET40, cryET43, cryET44, cryET45. cryET46. cryET47. cryET49, cryETSl, 

5 cryET52. cryET53. cryET54, cryET56. cryET57, cryET59, cryET60. cryET61, cryET62. 
cryET63, cryET64, cryET66. cryET67. cryET68. cryET72. cryET73. and cryET83 gene) that 
encodes one or more CiyET31. CryET40, CryET43, CryET44, CryET45, CryET46, CryET47, 
CiyET49, CryETSl, CryET52, CiyET53, CiyET54, CryET56, CryET57, CryET59, CryET60, 
CiyET61, CryET62. CryET63, CiyET64, CryET66, CryET67, CryET68, CryET72, CryET73. 

10 and CryET83 polypeptides. A preferred transgenic plant is an independent segregant and can 
transmit that gene and its activity to its progeny. A more preferred transgenic plant is 
homozygous for that gene, and transmits that gene to all of its offspring on sexual mating. Seed 
from a transgenic plant may be grown in the field or greenhouse, and resulting sexually mature 
transgenic plants are self-pollinated to generate true breeding plants. The progeny from these 

15 plants become true breeding lines that are evaluated for, by way of example, increased 
insecticidal capacity against coleopteran insects, preferably in the field, under a range of 
environmental conditions. The inventors contemplate that the present invention will find 
particular utility in the creation of transgenic plants of commercial interest including various turf 
grasses, wheat, com, rice, barley, oats, a variety of ornamental plants and vegetables, as well as a 

20 number of nut- and fruit-bearing trees and plants. 

5.10 Definitions 

The following words and phrases have the meanings set forth below. 
Expression: The combination of intracellular processes, including transcription and 
translation undergone by a coding DNA molecule such as a structural gene to produce a 
25 polypeptide. 

Identity or percent identity: refers to the degree of similarity between two nucleic acid 
or protein sequences. An alignment of the two sequences is performed by a suitable computer 
program. A widely used and accepted computer program for performing sequence alignments is 
CLUSTALW vl.6 (Thompson, et al. Nucl. Acids Res., 22: 4673-4680, 1994). The number of 
30 matching bases or amino acids is divided by the total number of bases or amino acids, and 
multiplied by 100 to obtain a percent identity. For example, if two 580 base pair sequences had 
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145 matched bases, they would be 25 percent identical. If the two compared sequences are of 
different lengths, the number of matches is divided by the shorter of the two lengths. For 
example, if there were 100 matched amino acids between 200 and a 400 amino acid proteins, 
they are 50 percent identical with respect to the shorter sequence. If the shorter sequence is less 
s than 150 bases or 50 amino acids in length, the number of matches are divided by 150 (for 
nucleic acid bases) or 50 (for amino acids), and multiplied by 100 to obtain a f)ercent identity. 

Promoter: A recognition site on a DNA sequence or group of DNA sequences that 
provide an expression control element for a structural gene and to which RNA polymerase 
specifically binds and initiates RNA synthesis (transcription) of that gene. 
10 Regeneration:The process of growing a plant from a plant cell (e.g., plant protoplast or 

explant). 

Structural gene: A polynucleotide sequence that encodes a polypeptide, that is 
expressed to produce a polypeptide, or which is cryptic or incapable of expression in its natural 
host cell but which can be isolated and purified and operably linked to at least a promoter 

IS functional in one or more host cell types to express the encoded polypeptide. 

Transformation: A process of introducing an exogenous DNA sequence (e.g., a 
vector, a recombinant DNA molecule) into a cell or protoplast in which that exogenous DNA is 
incorporated into a chromosome or is capable of autonomous replication. 

Transformed cell: A cell whose DNA has been altered by the introduction of an 

20 exogenous DNA molecule into that cell. 

Transgenic cell: Any cell derived or regenerated from a transformed cell or derived 
from a transgenic cell. Exemplairy transgenic cells include plant calli derived from a transformed 
plant cell and particular cells such as leaf, root, stem, e.g., somatic cells, or reproductive (germ) 
cells obtained from a transgenic plant. 

25 Transgenic plant: A plant or progeny thereof derived from a transformed plant cell or 

protoplast, wherein the plant DNA contains an introduced exogenous DNA molecule not 
originally present in a native, non-transgenic plant of the same strain. The terms "transgenic 
plant" and "transformed plant" have sometimes been used in the art as synonymous terms to 
define a plant whose DNA contains an exogenous DNA molecule. However, it is thought more 

30 scientifically correct to refer to a regenerated plant or callus obtained from a transformed plant 
cell or protoplast as being a transgenic plant, and that usage will be followed herein. 
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Vector: A DNA molecule capable of replication in a host cell and/or to which 
another DNA segment can be operatively linked so as to bring about replication of the attached 
segment. A plasmid is an exemplary vector. 
S.ll Isolating Homologous Gene and Gene Fragments 

The genes and 6-endotoxins according to the subject invention include not only the full 
length sequences disclosed herein but also fragments of these sequences, or fusion proteins, 
which retain the characteristic insecticidal activity of the sequences specifically exemplified 
herein. 

It should be apparent to a person skill in this art that insecticidal 5-endotoxins can be 
identified and obtained through several means. The specific genes, or portions thereof, may be 
obtained from a culture depository, or constructed symhetically, for example, by use of a gene 
machine. Variations of these genes may be readily constructed using standard techniques for 
making point mutations. Also, fragments of these genes can be made using commercially 
available exonucleases or endonucleases according to standard procedures. For example, 
15 enzymes such as Bal3l or site-directed mutagenesis can be used to systematically cut off 
nucleotides from the ends of these genes. Also, genes which code for active fragments may be 
obtained using a variety of other restriction enzymes. Proteases may be used to directly obtain 
active fragments of these 5-endotoxins. 

Equivalent 6-endotoxins and/or genes encoding these equivalent 5-endotoxins can also be 
20 isolated from Bacillus strains and/or DNA libraries using the teachings provided herein. For 
example, antibodies to the 5-endotoxins disclosed and claimed herein can be used to identify and 
isolate other 5-endotoxins from a mixture of proteins. Specifically, antibodies may be raised to 
the portions of the 5-endotoxins which are most constant and most distinct from other B. 
thuringiensis 5-endotoxins. These antibodies can then be used to specifically identify equivalent 
25 6-endotoxins with the characteristic insecticidal activity by immunoprecipitation, enzyme linked 
immunoassay (ELISA), or Western blotting. 

A further method for identifying the 5-endotoxins and genes of the subject invention is 
through the use of oligonucleotide probes. These probes are nucleotide sequences having a 
detectable label. As is well known in the art, if the probe molecule and nucleic acid sample 
30 hybridize by forming a strong bond between the two molecules, it can be reasonably assumed 
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that the probe and sample are essentially identical. The probe's detectable label provides a 
means for determining in a known manner whether hybridization has occurred. Such a probe 
analysis provides a rapid method for identifying formicidal 5-endotoxin genes of the subject 
invention. 

5 The nucleotide segments which are used as probes according to the invention can be 

synthesized by use of DNA synthesizers using standard procedures. In the use of the nucleotide 
segments as probes, the particular probe is labeled with any suitable label known to those skilled 
in the art, including radioactive and non-radioactive labels. Typical radioactive labels include 
'^P, '^^I, ^'S, or the like. A probe labeled with a radioactive isotope can be constructed from a 

10 nucleotide sequence complementary to the DNA sample by a conventional nick translation 
reaction, using a DNase and DNA polymerase. The probe and sample can then be combined in a 
hybridization buffer solution and held at an appropriate temperature until annealing occurs. 
Thereafter, the membrane is washed free of extraneous materials, leaving the sample and bound 
probe molecules typically detected and quantifred by autoradiography and/or liquid scintillation 

15 counting. 

Non-radioactive labels include, for example, ligands such as biotin or thyroxine, as well 
as enzymes such as hydrolases or peroxidases, or the various chemiluminescers such as luciferin, 
or fluorescent compounds like fluorescein and its derivatives. The probe may also be labeled at 
both ends with different types of labels for ease of separation, as, for example, by using an 

20 isotopic label at the end mentioned above and a biotin label at the other end. 

Duplex formation and stability depend on substantial complementarity between the two 
strands of a hybrid, and, as noted above, a certain degree of mismatch can be tolerated. 
Therefore, the probes of the subject invention include mutations (both single and multiple), 
deletions, insertions of the described sequences, and combinations thereof, wherein said 

25 mutations, insertions and deletions permit formation of stable hybrids with the target 
polynucleotide of interest. Mutations, insertions, and deletions can be produced in a given 
polynucleotide sequence in many ways, by methods currently known to an ordinarily skilled 
artisan, and perhaps by other methods which may become known in the ftiture. 

The potential variations in the probes listed is due, in part, to the redundancy of the 

30 genetic code. Because of the redundancy of the genetic code, i.e., more than one coding 
nucleotide triplet (codon) can be used for most of the amino acids used to make proteins. 



wo 01/19859 



PCT/USOO/25361 



52 



Therefore different nucleotide sequences can code for a particular amino acid. Thus, the amino 
acid sequences of the B. thuringiensis S-endotoxins and peptides can be prepared by equivalent 
nucleotide sequences encoding the same amino acid sequence of the protein or peptide. 
Accordingly, the subject invention includes such equivalent nucleotide sequences. Also, inverse 
5 or complement sequences are an aspect of the subject invention and can be readily used by a 
person skilled in this art. In addition it has been shown that proteins of identified structure and 
function may be constructed by changing the amino acid sequence if such changes do not alter 
the protein secondary structure (Kaiser and Kezdy, 1984). Thus, the subject invention includes 
) mutants of the amino acid sequence depicted herein which do not alter the protein secondary 
,0 structure, or ifthe structure is altered..the biological activity is substantially retaine Further. 

the invention also includes mutants of organisms hosting all or part of a S-endotoxin encoding a 
gene of the invention. Such mutants can be made by techniques well known to persons skilled in 
the art. For example, UV irradiation can be used to prepare mutants of host organisms. 
Likewise, such mutants may include asporogenous host cells which also can be prepared by 
15 procedures well known in the art 

6.0 Examples 

The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 
20 practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing fh)m the spirit and scope of the invention. 

6.1 Example 1 - Identification of A thuringiensis Strains Containing Novel 5 
25 -Endotoxins 

Wild-type B. thuringiensis strains containing novel insecticidal protein genes were 
identified by Southern blot hybridization studies employing specific DNA probes. Twenty-four 
unique cry genes were discovered that are related to B. thuringiensis genes in the cryl. cry2, or 
cry9 classes of toxin genes. 
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Various methods were employed to clone the novel genes and express them in a crystal 
protein-negative (Cry-) strain of 5. thuringiensis. These methods include PCR''"^' amplification of 
the region of cry7 -related genes that encodes the active portion of the toxin gene. The PCR"""" 
product is then joined to a fragment from the crylAc gene encoding the C-terminal region of the 
protoxin. This gene fusion was then expressed in a B. thuringiensis recombinant strain to 
produce a hybrid protoxin. In this instance, it is recognized that the sequence of the amplified 
DNA can be used to design hybridization probes to isolate the entire coding sequence of the 
novel cry gene from the wild-type B. thuringiensis strain. 

Wild-type B. thuringiensis strains were screened in a bioassay to identify strains that are 
toxic to larvae of lepidopteran insects (procedure described in Example 10). Active strains were 
then examined genetically to determine if they contain novel toxin genes. The method used to 
make this determination is described below and includes isolation of genomic DNA from the 
B. thuringiensis strain, restriction enzyme digestion. Southern blot hybridization, and analysis of 
the hybridizing restriction fragments to determine which genes are present in a strain. 

Total genomic DNA was extracted by the following procedure. Vegetative cells were 
resuspended in a lysis buffer containing 50 mM glucose, 25 mM Tris-HCl (pH 8.0), 10 mM 
EDTA, and 4 mg/ml lysozyme. The suspension was incubated at 37°C for 1 h. Following 
incubation, the suspension was extracted once with an equal volume of phenol, then once vsdth an 
equal volume of phenoI:chlorofonn:isoamyl alcohol (50:48:2), and once with an equal volume of 
chloroformrisoamyl alcohol (24:1). The DNA was precipitated from the aqueous phase by the 
addition of one-tenth volume 3 M sodium acetate and two volimies of 100% ethanol. The 
precipitated DNA was collected by centrifugation, washed with 70% ethanol and resuspended in 
distilled water. 

The DNA samples were digested with the restriction enzymes Clal and Pstl. The 
combination of these two enzymes give a digestion pattern of fragments that, when hybridized 
with the probe wd207 (described below), allows the identification of many of the known cryl- 
related toxin genes. Hybridizing fragments that did not correspond to the fragment sizes 
expected for the known genes were classified as unknown and were candidates for cloning and 
characterization. 

The digested DNA was size fractionated by electrophoresis through a 1 .0% agarose gel in 
IX TBE (0.089 M Tris-borate, 0.089 M boric acid, 0.002 M EDTA) overnight at 2 V/cm of gel 
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length. The fractionated DNA fragments were then transferred to a Millipore Immobilon-NC® 
nitrocellulose filter (Millipore Corp., Bedford, MA) according to the method of Southern (1975). 
The DNA fragments were fixed to the nitocellulose by baking the filter at SO^C in a vacuum 
oven. 

To identify the DNA fragment(s) containing the sequences related to cryl genes, the 
oligonucleotide wd207 was radioactively labeled at the 5' end and used as a hybridization probe. 
To radioactively label the probe, 1-5 pmoles of wd207 were added to a reaction (20 ul total 
volume) containing 3 ul [y-'^P]ATP (3,000 Ci/mmole at 10 mCi/ml), 70 mM Tris-HCl, pH 7.8, 
10 mM MgCh, 5 mM DTT, and 10 units T4 polynucleotide kinase (Promega Corp., Madison, 
WI). The reaction was incubated for 20 min at 37°C to allow the transfer of the radioactive 
phosphate to the 5'-end of the oligonucleotide, thus making it useful as a hybridization probe. 

The oligonucleotide probe used in this analysis, designated wd207, has the following 
sequence: 

5'-TGGATACTTGATCAATATGATAATCCGTCACATCTGTrnTA-3' (SEQ IDNO:5l) 
This oligonucleotide was designed to specifically hybridize to a conserved region of cryl 
genes downstream from the proteolyic activation site in the protoxin. Table 4 lists some of the 
B. thuringiensis toxin genes and their identities with wd207. The orientation of the wd207 
sequence is inverted and reversed relative to the coding sequences of the cry genes. 

Table 4 



cry Gene 


% Identity to wd207 


Nucleotide Position in CDS 


crylAa 


100% 


1903-1944 


crylBa 


95.2% 


1991-2032 


crylCa 


97.6% 


1930-1971 


crylDa 


97.6% 


1858-1899 


crylEa 


97.6% 


1885-1926 



The labeled probe was then incubated with the nitrocellulose filter overnight at 45°C in 
3X SSC (IX SSC = 0.15 M NaCl, 0.015 M sodium citrate), 0.1% SDS, lOX Denhardt's reagent 
(0.2% BSA, 0.2% polyvinylpyrrolidone, 0.2% FicoU), and 0.2 mg/ml heparin. Following this 
incubation period, the filter was washed in several changes of 3X SSC, 0.1% SDS at 45°C. The 
filter was blotted dry and exposed to Kodak X-OMAT AR X-ray film (Eastman Kodak Co., 
Rochester, NY) overnight at -70°C with an intensifying screen to obtain an autoradiogram. 
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The autoradiograms were analyzed to determine which wild-type B. thuringiensis strains 
contained cry J genes that could be novel. Since the probe was only 42 nucleotides, it is tmlikely 
that recognition sites for the restriction endonucleases Clal and Pstl would occur within the 
hybridizing region of the cry7-related genes. Therefore, it was assumed that each hybridizing 

5 restriction fragment represented one crv7 -related gene. The sizes, in kilobases (kb), of the 
hybridizing restriction fragments were determined based on the migration of the fragment in the 
agarose gel relative to DNA fragments of known size. The size of a fragment could be used to 
determine if that fragment represented a known cryj gene. For example, from the DNA 
sequence of the crylAc gene it was known that wd207 would hybridize to a 0.43 kb fragment 

10 after digestion of crylAc DNA with CM and Pstl. If the Southern blot analysis of a strain 
showed a 0.43 kb hybrizing fragment, that strain was assigned a probable genotype of crylAc. 
Fragments that could not be easily assigned a probable genotype were selected as candidates for 
further analysis. Because many cry^ -containing strains have more than one cryZ-related gene, 
all fragments were given a putative designation. 
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6.2 EXAMPLE 2 - IDENTIFICATION OF B. THURINGIENSJS STRAINS 
CONTAINING NOVEL CJ?y2-RELATED GENES 

Proteins encoded by the cry2 class of B. thuringiensis class of toxin genes have activity 
on the larvae of lepidopteran and diopteran insects. Southern blot hybridization analysis of DNA 

5 extracted from lepidopteran-active strains was utilized to identify novel cr>'2-related genes. 
Total genomic DNA was isolated as described in Section 6.1. The DNA was digested with the 
restriction endonuclease Sau3A and run on a 1 .2% agarose gel as described. The digested DNA 
was transferred to nitrocellulose filters to be probed vwth a DNA fragment containing the cry2Aa 
gene. Hybridizations were performed at 55°C and the filters washed and exposed to X-ray film 

10 to obtain an autoradiogram. ' 

SauSA digestion followed by hybridization with the cry2Aa gene gave characteristic 
patterns of hybridizing fragments allowing the identification of the cry2Aa. cry2Ab, and cry2Ac 
genes. Hybridizing fragments that differed from these patterns indicated the presence of a novel 
cr)/2-related gene in that strain. 

15 Once a strain was identified as containing one or more novel cry^-related genes, an 

additional Southern blot hybridization was performed. The procedures were the same as those 
already described above, except another restriction enzyme, usually //wdlll, was used. Since an 
enzyme like Mndlll (a "six base cutter") cuts DNA less frequently than does S'a«3A or Mbol, it 
was more likely to generate a restriction fragment containing the entire cr>'2-related gene which 

20 could then be readily cloned. 

63 Example 3 ~ Identification of A thuringiensis Strains Containing Novel ciyP-Tyipe 
Genes 

A cryP-specific oligonucleotide, designated pr56, was designed to facilitate the 
identification of strains harboring cryP-type genes. This oligonucleotide corresponds to 
25 nucleotides 4349-4416 of the gene (GenBank Accession No. Z37527). The sequence of pr56 
was as follows: 

5'-AGTAACGGTGTTACTATTAGCGAGGGCGGTCCATTCTTTAA 
AGGTCGTGCACTTCAGTTAGC-3' (SEQ ID NO:52). 

B. thuringiensis isolates were spotted or "patched" on SGNB plates, with no more than 50 
30 isolates per plate, and grown overnight at 25°C. The B. thuringiensis colonies were transferred 
to nitrocellulose filters and the filters placed, colony side up, on fresh SGNB plates for overnight 
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growth at SCC. Subsequently, the filters were placed, colony side up, on Whatman paper 
soaked in denaturing solution (1.5 M NaCl, 0.5 N NaOH) for 20 min. After denaturation. the 
filters were placed on Whatman paper soaked in neutralizing solution (3 M NaCl, 1.5 M Tris- 
HCl, pH 7.0) for 20 min. Finally, the filters were washed in 3X SSC (IX SSC = 0.15 M NaCl 
5 and 0.01 5 M sodium citrate) to remove cellular debris and baked in a vacuum oven at SO^C for 
90 min. 

The cryP-specific oligonucleotide pr56 (-10 pmoles) was end-labeled with [y-'^P]ATP 
using T4 polynucleotide kinase. The labeling reaction was carried out at 37°C for 20 min and 
terminated by incubating the reaction at 100 C for 3 min. After ethanol precipitation, the labeled 
10 oligonucleotide was resuspended in 1 00 jal distilled H2O. 

The filters were incubated with the cryP-specific probe in 6X SSC, lOX Denhardt's 
solution, 0.5% glycine, 0.2% SDS at 47°C overnight. The filters were washed twice in 3X SSC, 
0.1% SDS for 15 min at 47°C and twice in IX SSC, 0.1% SDS for 15 min at 47°C. The dried 
filters were exposed to X-OMAT XAR-5 film (Eastman Kodak Co.) at -70''C using an 
15 intensifying screen. The developed autoradiogram revealed 24 isolates of B.thuringiensis 
containing DNA that hybridized to the cry9 probe. 

To identify cryPC-type genes among these strains, two opposing oligonucleotide primers 
specific for the cry9C gene (GenBank Accession No. Z37527) were designed for polymerase 
chain reaction (PCR™) analyses. The sequence of pr58 is: 
20 5'-CGACTTCTCCTGCTAATGGAGG-3' (SEQ ID NO:53). 

The sequence of pr59 is: 

5'-CTCGCTAATAGTAACACCGTTACTTGCC-3' (SEQ ID NO:54). 

Plasmid DNAs were isolated from the isolates of B. thuringiensis believed to contain cryP-type 

genes. B. thuringiensis isolates were grown overnight at SO'^C on Luria agar plates and 2 

25 loopfiils of cells from each isolate were suspended in 50 mM glucose, 10 mM Tris-HCl, 1 mM 
EDTA (IX GTE) containing 4 mg/ml lysozyme. After a 1 0 min incubation at room temperature, 
plasmid DNAs were extracted using a standard alkaline lysis procedure (Maniatis et al, 1982). 
The plasmid DNAs were resuspended in 20 ^1 of IX TE (10 mM Tris-HCl, 1 mM EDTA, pH 
7.5). Two microliters of the plasmid DNA preparations were used in the PCR™ reactions. 

30 Amplifications were performed in 100 m1 volumes with a Perkin-Elmer DNA Thermocycler 
(Perkin-Elmer Cetus, Foster City, CA) using materials and methods provided in the Perkin- 
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Elmer GeneAmp™ kit. Conditions for the PGR™ were as follows: 95°C for 30 sec, 46''C for 30 
sec, 70°C for 1 min; 30 cycles. A PGR''^'^ using these primers and the cryPC gene as a template 
should yield a DNA fragment of ~970 bp. Of twenty-four strains found to hybridize to the cry9 
probe (SEQ ID NO:XX), only one strain, EG9290, yielded the predicted amplified DNA 
5 fragment. 

6.4 EXAMPLE 4 - CLONING OF B. THURJNGIENSIS TOXIN GENES BY 
CONSTRUCTING MBOl PARTIAL DIGEST LIBRARIES 

The restriction endonuclease Mbol was utilized in the construction of genomic DNA 
libraries because it has a recognition sequence of four base pairs which occurs frequently in long 

10 . stretches of DNA. Total genomic DNA was isolated from B. thuringiensis strains as described in 
Section 6.1. The DNA was digested under conditions allowing limited cleavage of a DNA 
strand. The method of establishing these conditions has been described (Maniatis et al, 1982). 
Digestion of DNA in this manner created a set of essentially randomly cleaved, overlapping 
fragments which were used to create a library representative of the entire genome. 

rs The digested DNA fragments were separated, according to size, by agarose gel 

electrophoresis through a 0.6% agarose, IX TEE gel, overnight at 2 volts/cm of gel length. The 
gel was stained with ethidium bromide so that the digested DNA could be visualized when 
exp)Osed to long-wave UV light. A razor blade was used to excise a gel slice containing DNA 
fragments of approximatfely 9- kb to 12-kb in size. The DNA fragments were removed from the 

20 agarose by placing the slice in a dialysis bag with enough TE (10 mM Tris-HCl, 1 mM EDTA) to 
cover the slice. The bag was then closed and placed in a horizontal electrophoresis apparatus 
filled with IX TBE buffer. The DNA was electroeluted from the slice into the TE at 100 volts 
for 2 h. The TE was removed from the bag, extracted with phenohchloroform (1:1), followed by 
extraction with chloroform. The DNA fragments are then collected by the standard technique of 

25 ethanol precipitation (see Maniatis e/ a/., 1982). 

To create a library in E. coli of the partially-digested DNA, the fragments were ligated 
into the shuttle vector, pHT315 (Arantes and Lereclus, 1991). This plasmid contains replication 
origins for E. coli and B. thuringiensis, genes for resistance to the antibiotics erythromycin and 
ampicillin, and a multiple cloning site. The Mbdl fragments were mixed with ^a/wHI-digested 

30 pHT31S that had been treated with calf intestinal, or bacterial, alkaline phosphatase (GibcoBRL, 
Gaithersburg, MD) to remove the 5 '-phosphates from the digested plasmid, preventing re- 
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ligation of the vector to itself. After purification. T4 ligase and a ligation buffer (Promega Cbrp., 
Madison, WI) were added to the reaction containing the digested vector and the Mbol fragments. 
These were incubated overnight at 15°C, or at room temperature for 1 h, to allow the insertion 
and ligation of the Mbol fragments into the pHT3 1 5 vector DNA. 
5 The ligation mixture was then introduced into transformation-competent E. coli SURE® 

cells (Stratagene Cloning Systems, La Jolla, CA), following procedures described by the 
manufacturer. The transformed E. coli cells were then plated on LB agar plates containing 50-75 
^g/ml ampicillin and incubated ovenight at 37°C. The growth of several hundred ampicillin- 
resistant colonies on each plate indicated the presence of recombinant plasmid in the cells of 

10 each of those colonies. 

To isolate the colonies harboring sequences encoding toxin genes, the colonies were first 
transferred to nitrocellulose filters. This was accomplished by simply placing a circular 
nitrocellulose filter (Millipore HATF 08525, Millipore Corp., Bedford, MA) directly on top of 
the LB-ampicillin agar plates containing the transformed colonies. When the filter was slowly 

15 peeled off of the plate the colonies stick to the filter giving an exact replica of the pattern of 
colonies from the original plate. Enough cells from each colony were left on the plate that 5 to 
6 h of growth at 37°C restored the colonies. The plates were then stored at 4°C until needed. 
The nitrocellulose filters with the transferred colonies are then placed, colony-side up, on fresh 
LB-ampicillin agar plates and allowed to grow at 37°C until they reached an approximate 1 mm 

20 diameter. 

To release the DNA from the recombinant E. coli cells the nitrocellulose filters were 
placed, colony-side up, on 2-sheets of Whatman 3MM chromatogrphy paper (Whatman 
International Ltd., Maidstone, England) soaked with 0.5 N NaOH. 1.5 M NaCl for 15 min. This 
treatment lysed the cells and denatured the released DNA allowing it to stick to the nitrocellulose 
25 filter. The filters were then neutralized by placing the filters, colony-side up, on 2 sheets of 
Whatman paper soaked with 1 M NH4-acetate, 0.02 M NaOH for 10 min. The filters were rinsed 
in 3X SSC, air dried, and baked for 1 h at 80°C in a vacuum oven. The filters were then ready 
for use in hybridization studies using probes to identify different classes of B. thuringiensis 
genes, as described in the above examples. 

In order to identify colonies containing cloned cryi-related genes, the cryi-specific 
oligonucleotide wd207 was labeled at the 5'-end with [y-^^PlATP and T4 polynucleotide kinase. 



30 
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The labeled probe was added to the filters in 3X SSC, 0.1% SDS, lOX Denhardt's reagent (0.2% 
BSA, 0.2% polyvinylpyrrolidone, 0.2% Ficoll), 0.2 mg/ml heparin and incubated overnight at 
47°C. These conditions allowed hybridization of the labeled oligonucleotide to related 
sequences present on the nitrocellulose blots of the transformed E. coli colonies. Following 

5 incubation the filters were washed in several changes of 3X SSC, 0.1% SDS at 45°C. The filters 
were blotted dry and exposed to Kodak X-OMAT AR X-ray film (Eastman Kodak Co., 
Rochester, NY) overnight at -70°C with an intensifying screen. 

Colonies that contain cloned cryi -related sequences were identified by aligning signals 
on the autoradiogram with the colonies on the original transformation plates. The isolated 

10 colonies were then grown in LB-ampicillin liquid medium from which the cells could ■ be 
harvested and recombinant plasmid prepared by the standard alkaline-lysis miniprep procedure 
(Maniatis et al. 1982). The plasmid DNA was then used as a template for DNA sequencing 
reactions necessary to confirm that the cloned gene was novel. If the cloned gene was novel, the 
plasmid was then introduced into a crystal protein-negative strain of B. thuringiensis {Cry) so 

IS that the encoded protein could be expressed and characterized. These procedure^ are described 
in detail in the follovtdng sections. 

6.5 Example 5 — Cloning of Specific Endonuclease Restriction Fragments 

The identification of a specific restriction fragment containing a novel B. thuringiensis 
gene has been described for crv2-related genes in Section 2. The procedure for cloning a 

20 restriction fragment of known size was essentially the same as described for cloning an Mbol 
fragment. The DNA was digested with a restriction enzyme {e.g., HindUl), and run through an 
agarose gel to separate the fragments by size. Fragments of the proper size, identified by 
Southern blot analysis (Example 2), were excised with a razor blade and electroeluted fi-om the 
gel slice into TE buffer from which they could be precipitated. The isolated restriction fi-agments 

25 were then ligated into an E. coli/B. thuringiensis shuttle vector and transformed into E. coli to 
construct a size-selected library. The library could then be hybridized with a specific gene 
probe, as described in Example 4, to isolate the colony containing the cloned novel gene. 

6.6 Example 6 ~ Cloning of PCR^'^-Amplified Fragments 

A rapid method for cloning and expressing novel cryl gene fi'agments fi'om 
30 B. thuringiensis was developed using the polymerase chain reaction. Flanking primers were 
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designed to anneal to conserved regions 5' to and within cryJ genes. With the exception of 

certain cry3 genes, most B. ihuringiensis cry genes are transcriptionally regulated, at least in part, 

by RNA polymerases containing the mother cell-specific or sigE, sigma factor. These o^- 

regulated cry genes possess 5' promoter sequences that are recognized by a^. Alignment of these 

5 promoter sequences reveals considerable sequence variation, although a consensus sequence can 

be identified (Baum and Malvar, 1995). A primer, designated "sigE", containing a sequence 

identical to the cryJAc promoter sequence, was designed that would anneal to related a 

promoter sequences 5' to uncharacterized cry genes. The sigE primer also includes a Bbul site 

(isoschizimer: Sphl) to facilitate cloning of amplified fragments.^ The sequence of the sigE 

10 primer is shown below: 

S^ATTTAGTAGCATGCGTTGCACTTTGTGCATTTTTTCATAAGATGA 

GTCATATGTTTTAAAT-3' (SEQ ID NO:55). 

The opposing primer, designated KpnR, anneals to a 3 '-proximal region of the cryl gene 
that is generally conserved. This primer incorporates an AspllS site (isoschizimer: Kpnl) 
15 conserved among the cryJA genes to facilitate cloning of the amplified fragment and to permit 
the construction of fusion proteins containing a carboxyl-terminal portion of the Cryl Ac protein. 
The sequence of the KpnR primer is shown below: 

5'-GGATAGCACTCATCAAAGGTACC-3' (SEQ ID NO:56) 

PCR™s were carried out using a Perkin Elmer DNA thermocycler and the following 
20 parameters: 94X, 2 min.; 40 cycles consisting of 94''C, 30 sec; 40°C, 2 min; 72°C, 3 min; and a 
10 second extension added to the ll^C incubation after 20 cycles. The standard PGR™ buffer 
(100 \xl volume) was modified to include IX Taq Extender buffer, 25 each of the sigE and 
KpnR primers, and 0.5 - 1.0 ^l of Taq Extender (Stratagene Inc.) in addition to 0.5 - 1.0 rI of 
Taq polymerase. Typically, 1-2 nl of the DNA preparations from novel B. thuringiensis isolates 
25 were included in the PCR™s. PCR™s with cry genes incorporating these primers resulted in the 
amplification of a ~2.3-kb DNA fragment flanked by restriction sites for Bbul and Aspl 1 8. 

For the cloning and expression of these gene fragments, the crylAc shuttle vector 
pEG1064 was used. This plasmid is derived from the cryl Ac shuttle vector pEG857 (Baum et 
al, 1990), with the following modifications. A frameshift mutation was generated at a unique 
30 Ncol site within the cryl Ac coding region by cleaving pEG857 with the restriction endonuclease 
Nco\, blunt-ending the JVcoI-generated ends with Klenow polymerase and ligating the blunt ends 
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with T4 ligase. In similar fashion, an Asp7]S site located in the multiple cloning site 3' to the 
crylAc gene was removed, leaving only the single Asp7\8 site contained within the cry] Ac 
coding sequence. The resulting plasmid, pEG1064, cannot direct the production of crystal 
protein when introduced into an acrystalliferous (Cry") strain of B. thuringiensis because of the 
5 frameshift mutation. For cloning and expression of unknown cry genes, pEG1064 was cleaved 
Bbul andAspin and the vector fragment purified following gel electrophoresis. Amplified 
fragments of unknown cry genes, obtained by PGR™ amplification of total B. thuringiensis 
DNA, were digested with the restriction endonucleases Bbul and AspllS and ligated into the 
Bbul andAspin sites of the pEG1064 vector fragment. The ligation mixture was used to 
10 transform the Cry' B. thuringiensis strains, EG 1 0368 or EG 1 0650. to chloramphenicol resistance 
using an electroporation protocol previously described (Mettus and Macaluso, 1990) 
Chloramphenicol-resistant (Cm*^) isolates were evaluated for crystal protein production by 
phase-contrast microscopy. Crystal forming (Ciy+) isolates were subsequently grown in C2 
liquid broth medium (Donovan et al., 1988) to obtain crystal protein for SDS-PAGE analysis and 
IS insect bioassay. 

Because of the frameshift mutation within the cry 1 Ac gene, the crystal proteins obtained 
from the transformants could not be derived from the vector pEG1064. The Cry* transformants 
thus contained unknown cry gene fragments fused, at the Aspl\% site, to a 3'-portion of the 
crylAc gene. Transcription of these gene fusions in B. thuringiensis was presumably directed 
20 from the promoter incorporated into the amplified cry gene fragment. The fusion proteins, 
containing the entire active toxin region of the unknown Cry protein, were capable of producing 
crystals in B. thuringiensis. 

6.7 Example 7 - Cloning of ciy9-Related Genes 

Total DNA was isolated from B. thuringiensis strain EG9290 for cloning studies. 

25 EG9290 was grown overnight at 30°C in IX brain heart infiasion, 0.5% glycerol (BHIG). In the 
morning, 500 of the overnight growth was suspended in 50 ml BHIG and the culture 
incubated at 30°C with agitation until the culture reached a Klett reading of 1 50 (red filter). The 
cells were harvested by centrifiigation, suspended in 5 ml IX GTE buffer containing 4 mg/ml 
lysozyme and 100 ^g/ml Rnase A, and incubated at 37°C for 20 min. The cells were lysed by 

30 the addition of 0.5 ml of 20% SDS. The released DNA was precipitated by the addition of 2.5 
ml 7.5 M ammonium acetate and 7 ml of isopropanol. The precipitated DNA was spooled out of 
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the mixture using a glass micropipette and washed in 80% ethanol. The DNA was resuspended 
in 10 ml IX TE, extracted with one volume each of buffered phenol and chlorofonn:isoamyl 
alcohol (24:1), and precipitated as before. The spooled DNA was washed in 80% ethanol, 
allowed to air dry for several min, and suspended in 600 ^1 IX TE. The DNA concentration was 

5 estimated at 500 ^g/ml. 

A library of EG9290 total DNA was constructed using partially digested Mbol fragments 
of EG9290 DNA and the general methods described herein. The partial Mbol fragments were 
inserted into the unique BamHl site of cloning vector pHT315. The ligation mixture was used to 
transform E, coli Sure™ cells to ampicillin resistance by electroporation employing 

10 electrocompetent cells and protocols provided by Stratagene (La Jolla, CA) and the BioRad Gene 
Pulser™ apparatus (Bio-Rad Laboratories, Hercules, CA). Recombinant clones harboring cry9- 
type genes were identified by colony blot hybridization using a ^^P-labeled probe consisting of 
the putative cry9C fragment generated by amplification of EG9290 DNA with primers pr58 and 
pr59. Plasmid DNAs were extracted from the E. coli clones using a standard alkaline lysis 

IS procedure. 

Plasmid DNAs from the E. coli recombinant clones were used to transform 
B. ihuringiensis strain EG 10368 to erythromycin resistance using the electroporation procedure 
described by Mettus and Macaluso (1990). Cells were plated onto starch agar plates containing 
20 ^g/ml erythromycin and incubated at 30°C. After six days, colonies with a more opaque 
20 appearance were recovered from the plates and streaked out onto fresh starch agar plates 
containing 20 |ag/ml erythromycin to isolate single colonies. Colonies exhibiting a more opaque 
appearance were observed to produce large parasporal inclusions/crystals by phase-contrast 
microscopy. 

Recombinant EG 10368 clones producing parasporal inclusion/crystals were evaluated for 
25 crystal protein production in broth culture. Single colonies were inoculated into C2 medium 
containing 10 fig/ml erythromycin and grown at 30''C for 3 days at 28-30''C, at which time the 
cultures were fully sporulated and lysed. Spores and crystals were pelleted by centrifugation and 
resuspended in 20 mM Tris-HCl, 1 mM EDTA, pH 7.0. Aliquots of this material were analyzed 
by SDS-polyacrylamide gel electrophoresis (SDS-PAGE). Two EG10368 recombinant clones, 
30 initially identified as 9290-2 and 9290-3, were observed to produce distinct proteins of -130 
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kDa. 9290-2 was designated EG12102 and 9290-3 was designated EG12103. The EG12102 
protein was designated CryET59 while the EG 1 2 1 03 protein was designated CryET60. 

Plasmid DNAs were prepared from EG12102 and EG12103 using a standard alkaline 
lysis procedure. Digestion of the plasmids with the restriction endonuclease Xba\ confirmed that 
the two strains harbored distinct cry genes. The cry plasmids of EG12102 and EG12103, 
designated pEG945 and pEG946, respectively, were used to transform E. coli Sure™ ceils to 
ampicillin resistance by electroporation, employing electrocompetent cells and protocols 
provided by Stratagene Inc. The E. coli recombinant strain containing pEG945 was designated 
EG12132, and the E. coli recombinant strain containing pEG946 was designated EG12133. 
pEG945 and pEG946 were purified from the E. coli recombinant strains using the QIAGEN 
midi-column plasmid purification kit and protocols (QIAGEN Inc., Valencia, CA). 

The cryET83 gene was cloned from B. thuringiensis strain EG6346 subspecies aizawai 
using similar methods. Southern blot analysis of genomic DNA from EG6346 revealed a unique 
restriction fragment that hybridized to the cryET59 probe. A series of degenerate 
oligonucleotide primers, pr95, pr97, and pr98, were designed to amplify cryf-related sequences 
from genomic DNA. The sequences of these primers are as shown: 

pr95: 5'- GTWTGGACSCRTCGHGATGTGG -3' (SEQ ID NO:57) 

pr97: 5'- TAATTTCTGCTAGCCCWATTTCTGGATTTAATTGTTGATC -3' 

(SEQ ID NO:58) 

pr98: 5'- ATWACNCAAMTWCCDTTRG -3' (SEQIDNO:59) 

whereD = A,G;H = A, C, T; M =A, C; N = A, C, G, T; R= A, G; S = C, G;andW = A,T. 

A PCR^" using Taq polymerase, Taq Extender™ (Stratagene, La Jolla, CA), the 
opposing primers pr95 and pr97, and total EG6346 DNA yielded a DNA fragment that was 
faintly visible on an ethidium bromide-stained agarose gel. This DNA served as the template for 
a second round of PGR™ using the opposing primers pr97 and pr98. The resulting amplified 
DNA fragment was suitable for cloning and served as a hybridization probe for subsequent 
cloning experiments. A library of EG6346 total DNA was constructed using partially digested 
9-12 kb Mbol fragments of EG6346 DNA ligated into the unique BamUl site of cloning vector 
pHT315. E. coli recombinant clones harboring the cryET83 gene were identified by colony blot 
hybridization using the EG6346-specific DNA fragment as a chemiluminescent hybridization 
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probe and the CDP-Star'"'^ nucleic acid chemiluminescent reagent kit from NEN™ Life Science 
Products (Boston, MA) to prepare the hybridization probe. The recombinant piasmid harboring 
the cryET83 gene was designated pEG397. The E. coli recombinant stain containing pEG397 
was designated EG 11786. The B. thuringiensis recombinant strain containing pEG397 was 
s designated EG 11785. 

6.8 Example 8 - Sequencing of Cloned B. thuringiensis Toxin Genes 

Partial sequences for the cloned toxin genes were determined following established 
dideoxy chain-termination DNA sequencing procedures (Sanger et al, 1977). Preparation of the 
double stranded piasmid template DNA was accomplished using a standard alkaline lysis 

10 procedure or using a QIAGEN piasmid purification kit (QIAGEN Inc., Valencia, CA). The 
sequencing reactions were performed using the Sequenase™ Version 2.0 DNA Sequencing Kit 
(United States Biochemical/Amersham Life Science Inc., Cleveland, OH) following the 
manufacturer's procedures and using •'^S-dATP as the labeling isotope (obtained from DuPont 
NEN® Research Products, Boston, MA). Denaturing gel electrophoresis of the reactions is done 

IS on a 6% (wt./vol.) acrylamide, 42% (wt./vol.) urea sequencing gel. The dried gels are exposed to 
Kodak X-OMAT AR X-ray film (Eastman Kodak Company, Rochester, NY) overnight at room 
temperature. Alternatively, some cry genes were sequenced using automated sequencing 
methods. DNA samples were sequenced using the ABI PRISM™ DyeDeoxy sequencing 
chemistry kit (Applied Biosystems, Foster City, CA) according to the manufacturer's suggested 

20 protocol. The completed reactions were run on as ABI 377 automated DNA sequencer. DNA 
sequence data were analyzed using Sequencher™ v3.0 DNA analysis software (Gene Codes 
Corp., Ann Arbor, MI). Successive oligonucleotides to be used for priming sequencing reactions 
were designed from the sequencing data of the previous set of reactions. 

The sequence determination for the crv/-related genes involved the use of the 

25 oligonucleotide probe wd207, described in Example 2, as the initial sequencing primer. This 
oligonucleotide anneals to a conserved region of cryl genes, but because of the inverted and 
reversed orientation of wd207, it generates sequence towards the 5 '-end of the coding region 
allowing sequence of the variable region of the gene to be read. A typical sequencing run of 
250-300 nucleotides was usually sufficient to determine the identity of the gene. If additional 

30 data were necessary, one or more additional oligonucleotides could be synthesized to continue 
the sequence until it could be determined if the sequence was unique. In cases where wd207 did 
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not function well as a primer, other oligonucleotides, designed to anneal to conserved regions of 
cryl genes, were used. One such oligonucleotide was the KpnR primer described herein above. 

The sequencing of the cloned cry2-related genes followed the same general procedures as 
those described for the cryl genes, except that oligonucleotides specific for conserved regions in 
5 cry2 genes were used as sequencing primers. The two primers used in these examples were 
wd268 and wd269, shown below. 

Primer wd268 corresponds to crylAa nucleotides 579-597 
5'-AATGCAGATGAATGGGG-3' (SEQ ID NO:60). 
10 Primer wd269 corresponds to cry2/4a 1740-1757 

5'-TGATAATGGAGCTCGTT-3' (SEQ ID NO:61) 

The sequencing of cryET59 and cryET60 commenced with the use of primer pr56. The 
sequencing of cry ET8 3 commenced with the use of primer pr98. Successive oligonucleotides to 
15 be used for priming sequencing reactions were designed from the sequencing data of the 
previous set of reactions. 

The derived sequences were compared to sequences of known cry genes using the 
FSTNSCAN program in the PC/GENE sequence analysis package (Intelligenetics, Mountain 
View, CA). This analysis permitted a preliminary classification of the cloned cry genes with 
20 respect to previously-known cry genes (Table 1 1). 
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TABLE 6 - HOMOLOGY COMPARISON OF DNA SEQUENCES 



Cloned Gene 



DNA Sequence Identity 



cryETSJ 
cryET40 
cryET43 
cryET44 
cryET45 
cryET46 
cryET47 
cryET49 
cryETSI 
cryET52 
cryET53 
cryET54 
cryET56 
cryET57 
cryET59 
cryETSO 
cryET6l 
cryET62 
cryET63 
cryET64 
cryET66 
cryET67 
cryET72 
cryET73 
cryET83 



90% identity with SEQ ID N0:4 of WO 98/40490 
99% identity with crylAa 
88% identity with cryJBdl 
90% identity with crylDa/lDb 
91% identity with ay I Da/I Db 
98% identity with crylGa 
99% identity with cry I Ah 
95% identity with cryIJa 
85% identity with cry 1 Ac 
84% identity with crylDa/lDb 
99% identity with SEQ ID NO:8 of US 5,723,758 
99.8% identity with cry] Be 
80% identity with cry I Ac 
98% identity with cry I Da 
95% identity with cry9Ca 
99.6% identity with cry9Aa 
97% identity with crylHa 
99% identity with crylAd 
93% identity with crylAc 
91% identity with SEQ ID NO:9 of US 5,723,758 

76% identity with crylGa 
99% identity with SEQ ID NO:10 of US 5,723,758 
98% identity with SEQ ID NO:4 of WO 98/40490 
99% identity with SEQ ID NO:6 of WO 98/40490 



' Ktup value set at 2 for FSTNSCAN. The cryET59 and cryETOO sequences were 
compared using the FASTA program (Ktup=6) in the PC/GENE sequence analysis package. 
5 6.9 Example 9 ~ Expression of Cloned Toxin Genes in a B. thuringiensis Host 

Plasmid DNA was isolated from E. coli colonies identified by hybridization to a gene- 
specific probe. The isolated plasmid was then introduced into a crystal protein-negative (Cry-) 
strain of B. thuringiensis using the electroporation protocol of Mettus and Macaluso (1990). 
Each of the cloning vectors used (see Table 5) has a gene to confer antibiotic resistance on the 

10 cells harboring that plasmid. B. thuringiensis transformants were selected by growth on agar 
plates containing 25 mg/ml erythromycin (pHT315) or 5 mg/ml chloramphenicol (pEG597 and 
pEG1064). Antibiotic-resistant colonies were then evaluated for crystal protein production by 
phase-contrast microscropy. Crystal producing colonies were then grown in C2 medium 
(Donovan et al., 1988) to obtain cultures which were analyzed by SDS-PAGE and insect 

15 bioassay. 

C2 cultures were inoculated with cells from Cry"^ colonies and grown for three days at 
25-30''C in the presence of the appropriate antibiotic. During this time the culture grew to 
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stationary phase, sporulated £ind lysed, releasing the protein inclusions into the medium. The 
cultures are harvested by centrifiigation, which pellets the spores and crystals. The pellets were 
washed in a solution of 0.005% Triton X-100®, 2 mM EDTA and centriftiged again. The 
washed pellets were resuspended at one-tenth the original volume in 0.005% Triton X-100®, 2 
5 mM EDTA. 

Crystal protein were solubilized from the spores-crystal suspension by incubating the 
suspension in a solubilization buffer [0.14 M Tris-HCl pH 8.0, 2% (wt./vol.) sodium dodecyl 
sulfate (SDS), 5% (vol./vol.) 2-mercaptoethanol, 10% (vol./vol.) glycerol, and 0.1% bromphenol 
blue] at 1 00°C for 5 min. The solubilized crystal proteins were size-fractionated by SDS-PAGE 
10 using a gel with an acrylamide concentration of 10%. After size fractionation the proteins were 
visualized by staining with Coomassie Brilliant Blue R-250. 

The expected size for Cryl- and Cry9-related crystal proteins was approximately 130 
kDa. The expected size for Cry2-related proteins was approximately 65 kDa. 

6.10 Example 10 ~ Insecticidal Activity of the Cloned B. ihuringiensis Toxin Genes 

15 B. thuringiensis recombinant strains producing individual cloned cry genes were grown 

in C2 medium imtil the cultures were fully sporulated and lysed. These C2 cultures were used to 
evaluate the insecticidal activity of the crystal proteins produced. Each culture was diluted with 
0.005% Triton® X-100 to achieve the appropriate dilution for two-dose bioassay screens. Fifty 
microliters of each dilution were topically applied to 32 wells containing 1 .0 ml artificial diet per 

20 well (surface area of 175 mm^). A single lepidopteran larvae was placed in each of the treated 
wells and the tray was covered by a clear perforated mylar sheet. With the exception of the P. 
xylostella bioassays, that employed 3rd instar larvae, all the bioassays were performed with 
neonate larvae. Larval mortality was scored after 7 days of feeding at 28-30 °C and percent 
mortality was expressed as ratio of the number of dead larvae to the total number of larvae 

25 treated (Table 12). In some instances, severe stunting of larval growth was observed after 7 
days, and the ratio of stunted/unstunted larva was also recorded. The bioassay results shown in 
Table 7 demonstrate that the crystal proteins produced by the recombinant B. thuringiensis 
strains do exhibit insecticidal activity and, fiirthermore. 
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Table 7A. Bioassay evaluations with ET crystal proteins 



Spodoptera exigua Spodoptera frugiperda 
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% mortality 


zjuuni/weii 
% mortality 
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Table 7B. Bioassay evaluations with ET crystal proteins 



PluteUa xyhstelia Ostrinia nubilalis 
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# stunted 
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# stunted 




% mortality 


% mortality 
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% mortality 
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0 
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0 


ET73 


2 


2 


0 


93 
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2 


2 


0 


0 


0 


0 
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Table 7C. Bioassay evaluations with ET crystal proteins 



Heliothis virescens Helicoverpa zea 
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Table 7D. Bioassay evaluations with ET crystal proteins 



Agrolis ipsilon Trichoplusia ni 
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that the crystal proteins exhibit differential activity towards the lepidopteran species tested. 

5 Additional bioassays were performed with the crystal proteins designated CryET59, 

CryET60, CryET66, and CryET83. Crystal proteins produced in C2 medium were quantified 
by SDS-PAGE and densitometry using the method described by Brussock, S. M. and Currier, T. 
C, 1990, "Use of Sodium Dodecyl Sulfate-Polyacrylamide Gel Electrophoresis to Quantify 
Bacillus thuringiensis 8-Endotoxins", in Analytical Chemistry of Bacillus thuringiensis (L. A. 

10 Hickle and W. L. Fitch, eds.). The American Chemical Society, pp. 78-87. 



wo 01/19859 



PCT/USOO/25361 



74- 



10 



IS 



20 



Table 8 - Bioassay Evaluation of CryET59 and CryET60 
Percent mortality 



Toxin 



Control' 

CryET59 

CryET59 

CryET59 

CryET60 

CryET60 



Dose 
ng/well 


AI 


HV 


HZ 


ON 


PX 


rPX 


SE 


TN 




2 


6 


0 


0 


2 


0 


2 


0 


100 


2 


37 


0 


94 


100 


100 


2 


13 


500 


11 


80 


3 


100 


100 


100 


0 


63 


5000 


62 


100 


6 


100 


100 


100 


71 


100 


500 


0 


93 


22 


100 


100 


100 


0 


100 


5000 


2 


100 


25 


100 


100 


100 


14 


100 



mbilalis. PX = Plutella xylostella. rPX = Plutella xylostella colony resistant to Cry lA and 
Cry IF toxins, SE = Spodoptera et/gaa. TN = Trichoplusiani. 
'Control = no toxin added. 

I 

The procedure was modified to eliminate the neutralization step with 3M HEPES. 
Crystal proteins resolved by SDS-PAGE were quantified by densitometry using a Molecular 
5 Dynamics model 300A computing densitometer and purified bovine serum albumin (Pierce, 
Rockford, IL) as a standard. 

The bioassay results shown in Table 8 demonstrate that CryET59 and CryET60 are toxic 
to a number of lepidopteran species, including a colony of P. xylostella that is resistant to Cry! A 
and Cry IF crystal proteins. Eight-dose assays with CryET66 also demonstrated excellent 
toxicity towards both the susceptible and resistant colonies of P. xylostella (Table 14). In this 
instance, eight crystal protein concentrations were prepared by serial dilution of the crystal 
protein suspensions in 0.005% Triton® X-100 and 50 ul of each concentration was topically 
applied to wells containing 1 .0 ml of artificial diet. After the wells had dried, a single larvae was 
placed in each of the treated wells and the tray was covered by a clear perforated mylar sheet (32 
larvae for each crystal protein concentration). Larval mortality was scored after 7 days of 
feeding at 28-30 °C. Mortality data was expressed as LC50 and LC9S values, the concentration of 
crystal protein (ng/175 mm^ diet well) causing 50% and 95% mortality, respectively (Daum, 
1970). 



Toxin 


LCso' 


95% C.I. 


LC9S'' 


Slope 


» Cry] Ac 
CrylC 
CryET66 


8.05 

25.06 

0.42 


5.0-15.2 

15.7-40.6 
0.4-0.5 


52.94 

117.07 

1.4 


2.01 
2.46 
3.13 
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Toxicity of CryET66 towards Cryl A-resistant Plutella xylostella 



Toxin 


LC50' 95% C.l. 


LC95' 


Slope 


CrylAc 


*No significant mortality 
> 






CrylC 


27.32 15.4-51.1 


156.13 


2.17 


CryET66 


1.65 1.3-2.0 


6.41 


2.79 



' the concentration of crystal protein, in nanograms of crystal protein per well, required to achieve 50% mortality 

5 

^ the concentration of crystal protein, in nanograms of crystal protein per well, required to achieve 95% mortality. 

Table 15 shows that the CryET83 protein exhibits toxicity towards a wide variety of 
10 lepidopteran pests and may exhibit improved toxicity towards S. exigua and H. virescens when 
compared to the other Cry9-type proteins CryET59 and CryET60. 



Table 10 - Toxicity of CryET83 towards lepidopteran larvae 


Dose-" 
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IS 

' Toxicity calculated as percent mortality among treated larvae. 

^ ng CryET83 crystal protein/ 1 75 mm^ diet well ^ 
^ Abbreviations described in Table 8; SF = Spodoptera frugiperda * ) 

20 

The recombinant B. thuringiensis strains listed in Table 5 were deposited with the ARS Patent Culture Collection 
and had been assigned the NRRL deposit numbers shown in Table 1 1 . 

Table 11. Biological Deposits 

25 



Polypeptide 


Polypeptide 


Polynucleotide 


Recomb. 


NRRL Deposit 


Designation 


Seq. ID No.: 


Seq ID No.: 


Strain 


No.: 


Cry ET31 


2 


1 


EG11562 


B-21921 


Cry ET40 


4 


3 


EG 11 901 


B-21922 


Cry ET43 


6 


5 


EG7692 


B-21923 


Cry ET44 


8 


7 


EG11629 


B-21924 


Cry ET45 


10 


9 


EG76d4 


B-21925 


Cry ET46 


12 


11 


EG7695 


B-21926 


Cry ET47 


14 


13 


EG7696 


B-21927 


Cry ET49 


16 


15 


EG11630 


B-21928 


Cry ET51 


18 


17 


EG11921 


B-21929 


Cry ET52 


20 


19 


EG 11 684 


B-21930 


Cry ET53 


22 


21 


EG11906 


B-21931 
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polypeptide 
Designation 



Polypeptide 
Seq. ID No.: 



Polynucleotide 
Seq ID No.; 



Recomb. 
Strain 



NRRL Deposit 
No.: 



Cry ET54 
Cry ET56 
Cry ET57 
Cry ET59 
Cry ET60 
Cry ET61 
Cry ET62 
Cry ET63 
Cry ET64 
Cry ET66 
Cry ET67 
Cry ET68 
Cry ET72 
CryET73 
CryET83 



63 
24 
26 
28 
30 
32 
34 
36 
38 
40 
42 
44 
46 
48 
50 



62 
23 
25 
27 
29 
31 
33 
35 
37 
39 
41 
■ 43 
45 
47 
49 



EG11907 
EG 11 909 
EG11910 
EG12102 
EG12103 
EG11634 
EG11635 
EG11636 
EG11638 
EG11640 
EG11642 
EG11644 
EG11440 
EG11465 
EG11785 



B-21932 
B-21933 
B-21934 
B-21935 
B-21936 
B-21937 
B-21938 
B-21939 
B-21940 
B-21941 
B-21942 
B-30137 
B-21943 
B-21944 
B-30138 



6.11 Example 11 - Modification of cry Genes for Expression in Plants 

Wild-type cry genes are known to be expressed poorly in plants as a full length gene or as 
a truncated gene. Typically, the G+C content of a cry gene is low (37%) and often contains 
many A+T rich regions, potential polyadenylation sites and numerous ATTTA sequences. Table 
12 shows a list of potential polyadenylation sequences which should be avoided when preparing 
the "plantized" gene construct. 
Table 12 - LIST OF SEQUENCES OF POTENTIAL POLYADENYLATION SIGNALS 



AATAAA* 
AATAAT* 
AACCAA 
ATATAA 
AATCAA 
ATACTA 
ATAAAA 
ATGAAA 



AAGCAT 
ATTAAT 
ATACAT 
AAAATA 
ATTAAA** 
AATTAA** 
AATACA** 
CATAAA** 



* indicates a potential major plant polyadenylation site. 

indicates a potential minor animal polyadenylation site. 
All othere are potential minor plant polyadenylation sites. 
The regions for mutagenesis may be selected in the following manner. All regions of the 
DNA sequence of the cry gene are identified which contained five or more consecutive base 
pairs which were A or T. These were ranked in terms of length and highest percentage of A+T in 
the surrounding sequence over a 20-30 base pair region. The DNA is analysed for regions which 
might contain polyadenylation sites or ATTTA sequences. Oligonucleotides are then designed 
which maximize the elimination of A+T consecutive regions which contained one or more 
polyadenylation sites or ATTTA sequences. Two potential plant polyadenylation sites have been 
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shown to be more critical based on published reports. Codons are selected which increase G+C 
content, but do not generate restriction sites for enzymes useful for cloning and assembly of the 
modified gene {e.g., BamHl, Bglll, Sad, Ncol, EcoKV, etc.). Likewise condons aire avoided 
which contain the doublets TA or GC which have been reported to be infrequently-found codons 
s in plants. 

Although the CaMV35S promoter is generally a high level constitutive promoter in most 
plant tissues, the expression level of genes driven the CaMV35S promoter is low in floral tissue 
relative to the levels seen in leaf tissue. Because the economically important targets damaged by 
some insects are the floral parts or derived from floral parts (e.^., cotton squares and bolls, 

10 tobacco buds, tomato buds and fruit), it is often advantageous to increase the expression of 
crystal proteins in these tissues over that obtained with the CaMV35S promoter. 

The 35S promoter of Figwort Mosaic Virus (FMV) is analogous to the CaMV35S 
promoter. This promoter has been isolated and engineered into a plant transformation vector. 
Relative to the CaMV promoter, the FMV 35S promoter is highly expressed in the floral tissue, 

IS while still providing similar high levels of gene expression in other tissues such as leaf. A plant 
transformation vector, may be constructed in which the full length synthetic cry gene is driven by 
the FMV 3SS promoter. Tobacco plants may be transformed with the vector and compared for 
expression of the crystal protein by Western blot or ELISA immunoassay in leaf and floral 
tissue. The FMV promoter has .been used to produce relatively high levels of crystal protein in 

20 floral tissue compared to the CaMV promoter. 

6.12 Example 12 — Expression of Synthetic cry Genes with ssRUBISCO Promoters and 
Chloroplast Transit Peptides 

The genes in plants encoding the small subunit of RUBISCO (SSU) are often highly 
expressed, light regulated and sometimes show tissue specificity. These expression properties 

25 are largely due to the promoter sequences of these genes. It has been possible to use SSU 
promoters to express heterologous genes in transformed plants. Typically a plant will contain 
multiple SSU genes, and the expression levels and tissue specificity of different SSU genes will 
be different. The SSU proteins are encoded in the nucleus and synthesized in the cytoplasm as 
precursors that contain an N-terminal extension known as the chloroplast transit peptide (CTP). 

30 The CTP directs the precursor to the chloroplast and promotes the uptake of the SSU protein into 
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the chloroplast. In this process, the CTP is cleaved from the SSU protein. These CTP sequences 
have been used to direct heterologous proteins into chloroplasts of transformed plants. 

The SSU promoters might have several advantages for expression of heterologous genes 
in plants. Some SSU promoters are very highly expressed and could give rise to expression 

5 levels as high or higher than those observed with the CaMV35S promoter. The tissue 
distribution of expression from SSU promoters is different from that of the CaMV35S promoter, 
so for control of some insect pests, it may be advantageous to direct the expression of crystal 
proteins to those cells in which SSU is most highly expressed. For example, although relatively 
constitutive, in the leaf the CaMV35S promoter is more highly expressed in vascular tissue than 

,0 in some other parts of the leaf, while most SSU promoters are most highly expressed in the 
mesophyll cells of the leaf. Some SSU promoters also are more highly tissue specific, so it could 
be possible to utilize a specific SSU promoter to express the protein of the present invention in 
only a subset of plant tissues, if for example expression of such a protein in certain cells was 
found to be deleterious to those cells. For example, for control of Colorado potato beetle in 

,5 potato, it may be advantageous to use SSU promoters to direct crystal protein expression to the 
leaves but not to the edible mbers. 

Utilizing SSU CTP sequences to localize crystal proteins to the chloroplast might also be 
advantageous. Localization of the B. thuringiensis crystal proteins to the chloroplast could 
protect these from proteases found in the cytoplasm. This could stabilize the proteins and lead to 

20 higher levels of accumulation of active toxin, cry genes containing the CTP may be used in 
combination with the SSU promoter or with other promoters such as CaMV35S. 
6.13 Example 13 -- Targeting of Cry Proteins to the Extracellular Space or Vacuole 
through the Use of Signal Peptides 
The B. thuringiensis proteins produced from the synthetic genes described here are 

25 localized to the cytoplasm of the plant cell, and this cytoplasmic localization results in plants that 
are insecticidally effective. It may be advantageous for some purposes to direct the 
B. thuringiensis proteins to other compartments of the plant cell. Localizing B. thuringiensis 
proteins in compartments other than the cytoplasm may result in less exposure of the 
5. thuringiensis proteins to cytoplasmic proteases leading to greater accumulation of the protein 

30 yielding enhanced insecticidal activity. Extracellular localization could lead to more efficient 
exposure of certain insects to the B. thuringiensis proteins leading to greater efficacy. If a 
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B. ihuringiensis protein were found to be deleterious to plant cell function, then localization to a 
noncytoplasmic compartment could protect these cells from the protein. 

In plants as well as other eukaryotes, proteins that are destined to be localized either 
extracellularly or in several specific compartments are typically synthesized with an N-terminal 
5 amino acid extension known as the signal peptide. This signal peptide directs the protein to enter 
the compartmentalization pathway, and it is typically cleaved from the mature protein as an early 
step in compartmentalization. For an extracellular protein, the secretory pathway typically 
involves cotranslational insertion into the endoplasmic reticulum with cleavage of the signal 
peptide occurring at this stage. The mature protein then passes through the Golgi body into 

10 vesicles that fuse with the plasma membrane thus releasing the protein into the extracellular 
space. Proteins destined for other compartments follow a similar pathway. For example, 
proteins that are destined for the endoplasmic reticulum or the Golgi body follow this scheme, 
but they are specifically retained in the appropriate compartment. In plants, some proteins are 
also targeted to the vacuole, another membrane botrnd compartment in the cytoplasm of many 

15 plant cells. Vacuole targeted proteins diverge from the above pathway at the Golgi body where 
they enter vesicles that fuse with the vacuole. 

A common feature of this protein targeting is the signal peptide that initiates the 
compartmentalization process. Fusing a signal peptide to a protein will in many cases lead to the 
targeting of that protein to the endoplasmic reticulum. The efficiency of this step may depend on 

20 the sequence of the mature protein itself as well. The signds that direct a protein to a specific 
compartment rather than to the extracellular space are not as clearly defined. It appears that 
many of the signals that direct the protein to specific compartments are contained within the 
amino acid sequence of the mature protein. This has been shown for some vacuole targeted 
proteins, but it is not yet possible to define these sequences precisely. It appears that secretion 

25 into the extracellular space is the "default" pathway for a protein that contains a signal sequence 
but no other compartmentalization signals. Thus, a strategy to direct B. thuringiensis proteins out 
of the cytoplasm is to fuse the genes fqj- synthetic B. thuringiensis genes to DNA sequences 
encoding known plant signal peptides. These fusion genes v^U give rise to B. thuringiensis 
proteins that enter the secretory pathway, and lead to extracellular secretion or targeting to the 

30 vacuole or other compartments. 
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Signal sequences for several plant genes have been described. One such sequence ii for 
the tobacco pathogenesis related protein PRlb has been previously described (Comelissen et ai, 
1986). The PRlb protein is normally localized to the extracellular space. Another type of signal 
peptide is contained on seed storage proteins of legumes. These proteins are localized to the 

5 protein body of seeds, which is a vacuole like compartment found in seeds. A signal peptide 
DNA sequence for the p-subunit of the 7S storage protein of common bean (Phaseolus vulgaris), 
PvuB has been described (Doyle et al, 1986). Based on the published these published 
sequences, genes may be synthesized chemically using oligonucleotides that encode the signal 
peptides for PRlb and PvuB. In some cases to achieve secretion or compartmentalization of 

,0 heterologous proteins, it may be necessary to include some amino acid sequence beyond the 
normal cleavage site of the signal peptide. This may be necessary to insure proper cleavage of 
the signal peptide. 

6.14 Example 14 ~ Isolation of Transgenic Plants Resistant to Insects Using cry 
Transgenes 

15 6.64.1 Plant Gene Construction 

The expression of a plant gene which exists in double-stranded DNA form involves 
transcription of messenger RNA (mRNA) from one strand of the DNA by RNA polymerase 
enzyme, and the subsequent processing of the mRNA primary transcript inside the nucleus. This 
processing involves a 3' non-translated region which adds polyadenylate nucleotides to the 3' 

20 end of the RNA. Transcription of DNA into mRNA is regulated by a region of DNA usually 
referred to as the "promoter". The promoter region contains a sequence of bases that signals 
RNA polymerase to associate with the DNA and to initiate the transcription of mRNA using one 
of the DNA strands as a template to make a corresponding strand of RNA. 

A number of promoters which are active in plant cells have been described in the 

25 literature. Such promoters may be obtained from plants or plant viruses and include, but are not 
limited to, the nopaline synthase (NOS) and octopine synthase (OCS) promoters (which are 
carried on tumor-inducing plasmids of Agrobacierium tumefaciens), the cauliflower mosaic virus 
(CaMV) 19S and 35S promoters, the light-inducible promoter from the small subunit of ribulose 
1,5-bisphosphate carboxylase (ssRUBISCO, a very abundant plant polypeptide), and the Figwort 

30 Mosaic Virus (FMV) 35S promoter. All of these promoters have been used to create various 
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types of DNA constructs which have been expressed in plants (see e.g., U. S. Patent No. 
5,463, 1 75, specifically incorporated herein by reference). 

The particular promoter selected should be capable of causing sufficient expression of the 
enzyme coding sequence to result in the production of an effective amount of protein. One set 

5 of preferred promoters are constitutive promoters such as the CaMV35S or FMV35S promoters 
that yield high levels of expression in most plant organs (U. S. Patent No. 5,378,619, specifically 
incorporated herein by reference). Another set of preferred promoters are root enhanced or 
specific promoters such as the CaMV derived 4 as-1 promoter or the wheat POXl promoter (U. 
S. Patent No. 5,023,179, specifically incorporated herein by reference; Hertig al., 1991). The 

10 root enhanced or specific promoters would be particularly preferred for the control of com 
rootworm {Diabroticus spp.) in transgenic com plants. 

The promoters used in the DNA constructs {i.e. chimeric plant genes) of the present 
invention may be modified, if desired, to affect their control characteristics. For example, the 
CaMV35S promoter may be ligated to the portion of the ssRUBISCO gene that represses the 

15 expression of ssRUBISCO in the absence of light, to create a promoter which is active in leaves 
but not in roots. The resulting chimeric promoter may be used as described herein. For purposes 
of this description, the phrase "CaMV35S" promoter thus includes variations of CaMV35S 
promoter, e.g., promoters derived by means of ligation v^th operator regions, random or 
controlled mutagenesis, etc. Furthermore, the promoters may be altered to contain multiple 

20 "enhancer sequences" to assist in elevating gene expression. 

The RNA produced by a DNA construct of the present invention also contains a 5' non- 
translated leader sequence. This sequence can be derived from the promoter selected to express 
the gene, and can be specifically modified so as to increase translation of the mRNA. The 5' 
non-translated regions can also be obtained from viral RNA's, from suitable eucaryotic genes, or 

25 from a synthetic gene sequence. The present invention is not limited to constructs wherein the 
non-translated region is derived fi-om the 5' non-translated sequence that accompanies the 
promoter sequence. 

For optimized expression in monocotyledenous plants such as maize, an intron should 
also be included in the DNA expression construct. This intron would typically be placed near 
30 the 5' end of the mRNA in imtranslated sequence. This intron could be obtained fi-om, but not 
limited to, a set of introns consisting of the maize hsp70 intron (U. S. Patent No. 5,424,412; 



BNSOOCID: <WO_01 19859AZJ_> 



wo 01/19859 



PCT/USOO/25361 



-82- 

specifically incorporated herein by reference) or the rice Act! intron (McElroy et al, 1990). As 
shown below, the maize hsp70 intron is useful in the present invention. 

As noted above, the 3' non-translated region of the chimeric plant genes of the present 
invention contains a polyadenylation signal which functions in plants to cause the addition of 
5 adenylate nucleotides to the 3' end of the RNA. Examples of preferred 3' regions are (1) the 3' 
transcribed, non-translated regions containing the polyadenylate signal of Agrobacterium tumor- 
inducing (Ti) plasmid genes, such as the nopaline synthase (NOS) gene and (2) plant genes such 
as the pea ssRUBISCO E9 gene (FischhofF et al., 1987). 
6.14.2 Plant Transformation and Expression , 
,0 A plant gene containing a structural coding sequence of the present invention can be 

inserted into the genome of a plant by any suitable method. Suitable plant transformation vectors 
include those derived from a Ti plasmid of Agrobacterium lumefaciens, as well as those 
disclosed, e.g., by Herrera-Estrella (1983), Bevan (1983). Klee (1985) and Eur. Pat. Appl. Publ. 
No. EP0120516. In addition to plant transformation vectors derived from the Ti or root-inducing 
,5 (Ri) plasmids of Agrobacterium, alternative methods can be used to insert the DNA constructs of 
this invention into plant cells. Such methods may involve, for example, the use of liposomes, 
electroporation, chemicals that increase free DNA uptake, free DNA delivery via microprojectile 
bombardment, and transformation using viruses or pollen (Fromm et al, 1986; Armstrong et ai, 
1 990; Fromm et al. , 1 990). 
20 6.14.3 Construction of Monocot Plant Expression Vectors for cry Genes 

For efficient expression of cry genes in transgenic plants, the gene must have a suitable 
sequence composition (Diehn et ai, 1996). To place the cry gene in a vector suitable for 
expression in monocotyledonous plants {i.e. under control of the enhanced Cauliflower Mosaic 
Virus 35S promoter and link to the hsp70 intron followed by a nopaline synthase 
25 polyadenylation site as in U. S. Patent No. 5.424.412, specifically incorporated herein by 
reference), a vector such as pMON19469 may be used. Such a vector is conveniently digested 
with Ncol and Eco^ restriction enzymes. The larger vector band of approximately 4.6 kb is 
then electrophoresed, purified, and ligated with T4 DNA ligase to an Nco\-Ecom fragment 
which contains the synthetic cry gene. The ligation mix is then transformed into E. coli, 
30 carbenicillin resistant colonies recovered and plasmid DNA recovered by DNA miniprep 
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procedures. The DNA is then subjected to restriction endonuclease analysis with enzymes such 
as Ncol and EcoKi (together), Notl, and/or Psil individually or in combination, to identify clones 
containing the cry coding sequence fiised to an intron such as the hsp70 intron, placed under the 
control of the enhanced CaMV35S promoter. 
5 To place the gene in a vector suitable for recovery of stably transformed and insect 

resistant plants, the 3.75-kb Notl restriction fragment from pMON33708 containing the lysine 
oxidase coding sequence fused to the hsp70 intron under control of the enhanced CaMV35S 
promoter may be isolated by gel electrophoresis and purification. This fragment is then ligated 
with a vector such as pMON30460 which has been previously treated with Notl and calf 

10 intestinal alkaline phosphatase (pMON30460 contains the neomycin phosphotransferase coding 
sequence under control of the CaMV35S promoter). Kanamycin resistant colonies may then be 
obtained by transformation of this ligation mix into £1 coli and colonies containing the desired 
plasmid may be identified by restriction endonuclease digestion of plasmid miniprep DNAs. 
Restriction enzymes such as Notl, EcoKV, Hindlll, Ncol, EcoRI, and BgUl may be used to 

15 identify the appropriate clones in which the orientation of both genes are in tandem (i.e. the 3' 
end of the cry expression cassette is linked to the 5' end of the nptll expression cassette). 
Expression of the Cry protein by the resulting plasmid in com protoplasts may be confirmed by 
electroporation of the vector DNA into protoplasts followed by protein blot and ELISA analysis. 
This vector may be introduced into the genomic DNA of com embryos by particle gun 

20 bombardment followed by paromomycin selection to obtain com plants expressing the cry gene f 
essentially as described in U. S. Patent No. 5,424,412, specifically incorporated herein by 
reference. 

As an example, the vector may be introduced via cobombardment with a hygromycin 
resistance conferring plasmid into immature embryo scutella (lES) of maize, followed by 
25 hygromycin selection, and regeneration. Transgenic com lines expressing the cry protein may 
then be identified by ELISA analysis. Progeny seed from these events may then be subsequently 
tested for protection from insect feeding. 

7.0 References 

The following references, to the extent that they provide exemplary procedural or other 
30 details supplementary to those set forth herein, are specifically incorporated herein by reference. 
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CLAIMS: 

1 . An isolated polypeptide at least 85% identical to SEQ ID NO: 1 8, SEQ ID NO:20, SEQ ID 
NO:24, SEQ ID NO:40. or SEQ ID NO:44. 

2. An isolated polypeptide at least 91% identical to SEQ ID N0:2, SEQ ID N0:6. SEQ ID 
5 N0:8, SEQ ID NO: 10, SEQ ID NO: 18. SEQ ID NO:20, SEQ ID NO:24, SEQ ID NO:38, 

SEQ ID NO:40, or SEQ ID NO:44. 

3. An isolated polypeptide at least 95% identical to SEQ ID N0:2, SEQ ID NO:6, SEQ ID 
N0:8, SEQ ID NO: 10, SEQ ID NO: 16, SEQ ID NO: 18, SEQ IDNO:20, SEQ ID NO:22, 

I SEQ ID NO:24, SEQ ID NO:28, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ 

10 ID NO:44, or SEQ ID NO:50. 

4. An isolated polypeptide at least 99% identical to SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
N0:6, SEQ ID N0:8, SEQ ID NO:10, SEQ ID N0:12, SEQ ID NO:14, SEQ ID N0:16, 
SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ 
ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID 

15 NO:38, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID 

NO:48, SEQ ID NO:50 or SEQ ID NO: 63. 

5. The polypeptide of claim 4, comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO:2, SEQ ID N0:4, SEQ ID N0:6, SEQ ID NO:8, SEQ ID 
NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID 

20 NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID 

NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID 
NO:40, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID 
NO:50 and SEQ ID NO: 63. 

6. An isolated nucleic acid sequence encoding the polypeptide of any preceding claim. 

25 7. A composition comprising the polypeptide of any of claims 1 to 5, and a diluent. 

8. The composition of claim 7, wherein the polypeptide is selected from the group consisting 
of SEQ ID NO:2, SEQ ID N0:4, SEQ ID N0:6, SEQ ID N0:8, SEQ ID NO: 10, SEQ ID 
N0:12, SEQ ID NO:14, SEQ ID N0:16. SEQ ID N0:18, SEQ ID NO:20, SEQ ID 
NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID 
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NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID 
NO:42, SEQ ID NO:44, SEQ ID NO:46, SEQ ID NO:48, SEQ ID NO:50 and SEQ ID 
NO: 63. 

9. The composition of claim 7, comprising a cell extract, cell suspension, cell homogenate, 
cell lysate, cell supernatant, cell filtrate, or cell pellet of Bacillus thuringiensis cells. 

10. The composition of claim 7, wherein said composition is a powder, dust, pellet, granule, 
spray, emulsion, colloid, or solution. 

11. The composition of claim 7, comprising from about 1% to ^bout 99% by weight of said 
polypeptide. 

12. An insecticidal polypeptide prepared by a process comprising the steps of: 

(a) culturing a Bacillus thuringiensis cell having the accession number NRRL B- 

21784, NRRL B-21783, NRRL B-21917, NRRL B-21786, NRRL B-21787, NRRL B- 

21785, NRRL B-21788, NRRL B-21915 or NRRL B-21916 under conditions effective to 
produce an insecticidal polypeptide; and 

(b) obtaining from said cell the insecticidal polypeptide so produced. 

13. A Bacillus thuringiensis cell having the NRRL accession number NRRL B-21784, 
NRRL B-21783, NRRL B-21917, NRRL B-21786, NRRL B-21787, NRRL B»21785, 
NRRL B-21788, NRRL B-21915 or NRRL B-21916. 

14. An isolated polynucleotide at least 85% identical to SEQ ID NO:17, SEQ ID N0:19, SEQ 
ID NO:23, SEQ ID NO:39, or SEQ ID NO:43. 

15. The polynucleotide of claim 1 5, wherein the polynucleotide is at least 95% identical to SEQ 
ID N0:1, SEQ ID N0:5, SEQ ID N0:7, SEQ ID NO:9, SEQ ID NO: 15, SEQ ID NO: 17, 
SEQ ID NO:19, SEQ ID N0:21, SEQ ID N0.23, SEQ ID NO:27, SEQ ID NO:35. SEQ 
ID NO:37, SEQ ID NO:39, SEQ ID NO:43, or SEQ ID NO:49. 

1 6. The polynucleotide of claim 1 5, wherein the polynucleotide is at least 99% identical to SEQ 
ID N0:1, SEQ ID N0:3, SEQ ID N0:5, SEQ ID NO:7, SEQ ID N0:9, SEQ ID NO:l 1, 
SEQ ID N0:13. SEQ ID N0:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID N0:21, SEQ 
ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID 
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NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID N0:41, SEQ ID 
NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49. or SEQ ID NO:62. 

17. The polynucleotide of claim 15, comprising the nucleic acid sequence of SEQ ID N0:1, 
SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:7, SEQ ID NO:9, SEQ ID N0:1 1, SEQ ID 

5 N0:13, SEQ ID NO:15, SEQ ID N0:17, SEQ ID N0:19, SEQ ID NO:21, SEQ ID 

NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID 
NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:41, SEQ ID 
NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49, or SEQ ID NO:62. 

18. The polynucleotide of any of claims 14-17, wherein th^ isolated polynucleotide is 
10 provided in a vector. 

19. The polynucleotide of any of claims 14-17, wherein the isolated polynucleotide is 
operably linked to a promoter. 

20. The polynucleotide of claim 19, wherein the promoter is a plant-expressible promoter. 

21. The polynucleotide of claim 20, wherein the plzmt-expressible promoter is selected from 
15 the group consisting of com sucrose synthetase 1, com alcohol dehydrogenase 1, com 

light harvesting complex, com heat shock protein, pea small subunit RuBP carboxylase, 
Ti plasmid mannopine synthase, Ti plasmid nopaline synthase, petunia chalcone 
isomerase, bean glycine rich protein 1, Potato patatin, lectin, CaMV 35S, and the S-E9 
small subunit RuBP carboxylase promoter. 

20 22. The polynucleotide of claim 1 8, wherein the vector is a plasmid, baculovirus, artificial 
chromosome, virion, cosmid, phagemid, phage or viral vector. 

23. A transformed host cell comprising a nucleic acid sequence encoding the polypeptide of any 
of claims 1 to S. 

24. The transformed host cell of claim 23, wherein the nucleic acid is selected from the group 
25 consisting of SEQ ID NO:l, SEQ ID N0:3, SEQ ID N0:5, SEQ ID N0:7, SEQ ID NO:9, 

SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ 
ID N0:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO;27, SEQ ID NO:29, SEQ ID 
NO:31, SEQ ID NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39, SEQ ID 
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N0:41, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49. and SEQ ID 
NO:62. 

25. The transformed host cell of claim 23, further defined as a prokaryotic or eukaryotic host 
cell. 

5 26. The transformed host cell of claim 23, further defined as a bacterial cell or a plant cell. 

27. The transformed host cell of claim 26, wherein said bacterial cell is a Bacillus thuhngiemis. 
Bacillus subtilis. Bacillus megaterium, Bacillus cereus, Escherichia. Salmonella, 
Agrobacterium or Pseudomonas cell. 

28. The transformed host cell of claim 26, wherein said bacterial cell is a Bacillus thuringiensis 
10 NRRL B-21784, NRRL B-21783, NRIU. B-21917, NRRL B-21786, NRRL B-21787, 

NRRL B-21785, NRRL B-21788, NRRL B-21915 or NRRL B-21916 cell. 

29. The transformed host cell of claim 27, wherein said bacterial cell is an Agrobacterium 
tumefaciens cell. 

30. The transformed host cell of claini 26, further defined as a monocotyledonous or 
IS dicotyledonous plant cell. 

31. The transformed host cell of claim 30, wherein said plant cell is selected from the group 
consisting of a com, wheat, soybean, oat, cotton, rice, lye, sorghum, sugarcane, tomato, 
tobacco, kapok, flax, potato, barley, turf grass, pasture grass, berry, fruit, legume, vegetable, 
ornamental plant, shrub, cactus, succulent, and tree cell. 

20 32. The transformed host cell of claim 30, wherein said plant cell is a com, wheat, rice, or 
sugarcane cell. 

33. The transformed host cell of claim 30, wherein said plant cell is a soybean, cotton, potato, 
tomato, or tobacco cell. 

34. A transgenic plant having incorporated into its genome a selected polynucleotide comprising 
25 a first sequence region that encodes the polypeptide of any of claims 1 to 5. 

35. The transgenic plant of claim 34, wherein said first sequence region encodes SEQ ID N0:2, 
SEQ ID N0:4, SEQ ID N0:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID 
N0:14, SEQ ID NO:l6, SEQ ID N0:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID 
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NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:34. SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42. SEQ ID 
NO:44, SEQ ID NO:46, SEQ ID NO:48. SEQ ID NO:50, or SEQ ID NO: 63. 
36. The transgenic plant of claim 34, wherein said first sequence region comprises SEQ ID 

5 NO: 1 , SEQ ID NO:3, SEQ ID N0:5, SEQ ID N0:7. SEQ ID N0:9, SEQ ID NO: 1 1 , SEQ 

ID NO:13, SEQ ID N0:15. SEQ ID N0:17. SEQ ID NO:19, SEQ ID NO:21, SEQ ID 
NO:23. SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID 
NO:33, SEQ ID NO:35, SEQ ID NO:37, SEQ ID NO:39. SEQ ID N0:41, SEQ ID 
NO:43, SEQ ID NO:45, SEQ ID NO:47, SEQ ID NO:49, or 3EQ ID NO:62. 

10 37. The transgenic plant of claim 34, further defined as a monocotyledonous plant. 

38. The transgenic plant of claim 34, further defined as a com, wheat, oat, rice, barley, turf 
grass, or pasture grass plant. 

39. The transgenic plant of claim 34, further defined as a dicotyledonous plant. 

40. The transgenic plant of claim 34, further defined as a legume, soybean, tobacco, tomato. 
1 5 potato, cotton, fruit, berry, vegetable or tree. 

41. A progeny of any generation of the transgenic plant of claim 34, wherein said progeny 
comprises said first selected sequence region. 

42. A seed of any generation of the plant of claim 34, wherein said seed comprises said first 
sequence region. 

20 43. A seed of any generation of the progeny of claim 39, wherein said seed comprises said first 
sequence region. 

44. A plant of any generation of the seed of claim 42 or 43, wherein said plant comprises said 
first sequence region. 

45. A method for controlling Lepdopteran insects comprising contacting said insect with the 
25 polypeptide of any of claims 1 to 5. 

46. The method of claim 45, wherein the polypeptide is provided in a powder, dust, pellet, 
granule, spray, emulsion, colloid, or solution. 

47. The method of claim 45, wherein the polypeptide is provided in a transformed host cell. 
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48. The method of claim 47, wherein the transformed host cell is a bacterial or plant cell. 

49. The method of claim 45, wherein the polypeptide is provided in a transgenic plant. 

50. The method of claim 49, wherein the plant is a com, cotton, or soybean plant. 

51. A method of preparing an insect resistant plant comprising: 

5 (a) contacting recipient plant cells with a polynucleotide composition 

comprising at least a first nucleic acid sequence encoding the polypeptide of any 
of claims 1 to S; 

(b) selecting a recipient plant cell comprising the first nucleic acid sequence; 
and 

10 (c) regenerating a plant from the selected cell; 

wherein said plant has enhanced insect resistance relative to the corresponding non- 
transformed plant. 
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8.0 Sequence Listing 

SEQUENCE LISTING 

5 <110> Baum, James A. 

Chu, Chih-Rei 
Donovan, William P. 
Gilmer, Amy J. 
Rupar, Mark J. 



10 



15 



20 



35 



40 



45 



50 



55 



<120> Lepidopteran-Active Bacillus thuringiensis 

Delta-Endotoxin Compositions and Methods of Use 

<130> meco201 

<140> 
<141> 

<160> 63 

<170> Patentin Ver. 2.1 



<210> 1 

<211> 1899 

25 <212> DNA 

<213> Bacillus thuringiensis 

<220> 
<221> CDS 
30 <222> (1) . . (1899) 

<400> 1 

atg aat aat gta tta aat aac gga aga act act att tgt gat gcg tat 

Met Asn Asn Val Leu Asn Asn Gly Arg Thr Thr He Cys Asp Ala Tyr 
15 10 15 

aat gta gtg gcc cat gat oca ttt agt ttt gag cat aaa tea tta gat 
Asn Val Val Ala His Asp Pro Phe Ser Phe Glu His Lys Ser Leu Asp 
20 25 30 



tat gta get cct ata gtc gga act gtt tct age ttt ctg eta aag aag 

Tyr Val Ala Pro He Val Gly Thr Val Ser Ser Phe Leu Leu Lys Lys 

50 55 60 

gtg ggg agt ctt att gga aaa agg ata ttg agt gaa tta tgg ggg tta 

Val Gly Ser Leu He Gly Lys Arg He Leu Ser Glu Leu Trp Gly Leu 

65 70 75 80 

ata ttt cct agt ggt age aea aat eta atg caa gat att tta agg gag 

He Phe Pro Ser Gly Ser Thr Asn Leu Met Gin Asp He Leu Arg Glu 

85 90 95 

aea gaa caa tte eta aat caa aga ctt aat aca gao act ctt gcc cgt 



48 



96 



acc ate cga aaa gaa tgg atg gag tgg aaa aga aca gat cat agt tta 144 
Thr He Arg Lys Glu Trp Met Glu Trp Lys Arg Thr Asp His Ser Leu 
35 40 45 



192 



240 



288 



336 
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Thr Glu Gin Phe Leu Asn Gin Arg Leu Asn Thr Asp Thr Leu Ala Arg 
100 105 110 

gta aat gcg gaa ttg gaa ggg ctg caa gcg aat ata agg gag ttt aat 384 
5 Val Asn Ala Glu Leu Glu Gly Leu Gin Ala Asn He Arg Glu Phe Asn 
115 120 125 



10 



caa caa gta gat aat ttt tta aat cot act caa aac cot gtt cct tta 
Gin Gin Val Asp Asn Phe Leu Asn Pro Thr Gin Asn Pro Val Pro Leu 
130 135 140 



etc aat gca gat gaa tgg gga att tea gca gca aca tta cgt acg tat 
25 Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Leu Arg Thr Tyr 
195 200 205 

caa aat tat ctg aaa aat tat aca aca gag tac tct aat tat tgt ata 
Gin Asn Tyr Leu Lys Asn Tyr Thr Thr Glu Tyr Ser Asn Tyr Cys He 
30 210 215 220 

aat acg tat caa act gcg ttt aga ggt tta aac acc cgt tta cac gat 

Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His Asp 

225 230 235 240 

35 

atg tta gaa ttt aga aca tat atg ttt tta aat gta ttt gaa tat gta 

Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 

245 250 255 

40 tct ate tgg teg ttg ttt aaa tat caa age ctt eta gta tct tct ggc 
Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Leu Val Ser Ser Gly 
260 265 270 

get aat tta tat gca age ggt agt gga cea eag eag act caa tea ttt 
45 Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Gin Gin Thr Gin Ser Phe 
275 280 285 

act tea caa gac tgg cea ttt tta tat tct ctt ttc caa gtt aat tea 
Thr Ser Glh Asp Trp' Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser 
50 290 295 300 

aat tat gtg tta aat ggc ttt agt ggc get aga ctt acg eag act ttc 

Asn Tyr Val Leu Asn Gly Phe Ser Gly Ala Arg Leu Thr Gin Thr Phe 

305 310 315 320 

55 

cct aat att ggt ggt tta cct ggt act act aca act cac gca ttg ctt 

Pro Asn He Gly Gly Leu Pro Gly Thr Thr Thr Thr His Ala Leu Leu 



432 



480 



528 



tea ata act tct tea gtt aat aca atg eag caa tta ttt eta aat aga 

Ser He Thr Ser Ser Val Asn Thr Met Gin Gin Leu Phe Leu Asn Arg 

145 150 155 160 

15 

tta ccc cag ttc cgt gtg caa gga tac caa ctg tta tta tta cct tta 

Leu Pro Gin Phe Arg Val Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 

165 170 175 

20 ttt gca cag gca gee aat atg cat ctt tct ttt att aga gat gtt gtt 576 

Phe Ala Gin Ala Ala Asn Met His Leu Ser Phe He Arg Asp Val Val 

180 185 190 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



wo 01/19859 



PCT/USOO/25361 



10 



20 



30 



35 



40 



45 



50 



55 



325 330 335 

gcg gca agg gtc aat tac agt gga gga gtt teg tct ggt gat ata ggc 1056 
Ala Ala Arg Val Asn Tyr Ser Gly Gly Val Ser Ser Gly Asp lie Gly 
340 345 350 



get gtg ttt aat caa aat ttt agt tgt age aca ttt etc cca eet ttg 
Ala Val Phe Asn Gin Asn Phe Ser Cys Ser Thr Phe Leu Pro Pro Leu 
355 360 365 



15 ggt gtt aat ace gtt aea aat tgg caa aca gaa teg ttt gag tea act 
Gly Val Asn Thr Val Thr Asn Trp Gin Thr Glu Ser Phe Glu Ser Thr 
385 390 395 400 



tat tta aga gta tct tea eta gga aat tec act att cga gtt act ata 
Tyr Leu Arg Val Ser Ser Leu Gly Asn Ser Thr lie Arg Val Thr lie 
545 550 555 560 



1104 



tta aca cca ttt gtt agg agt tgg eta gat tea ggt tea gat cga ggg 1152 
Leu Thr Pro Phe Val Arg Ser Trp Leu Asp Ser Gly Ser Asp Arg Gly 
370 375 380 



1200 



tta ggt tta agg tgt ggt get ttt aea get egt ggt aat tea aac tat 1248 
Leu Gly Leu Arg . Cys Gly Ala Phe Thr Ala Arg Gly Asn Ser Asn Tyr 
405 410 415 



tte cea gat tat ttt ate cgt aat att tea gga gtt cct tta gtt gtt 1296 
Phe Pro Asp Tyr Phe lie Arg Asn He Ser Gly Val Pro Leu Val Val 
25 420 425 430 

aga aat gaa gat tta aga aga ceg tta eac tat aat gaa ata aga aat 1344 
Arg Asn Glu Asp Leu Arg Arg Pro Leu His Tyr Asn Glu lie Arg Asn 
435 440 445 

ata gaa agt cct tea gga aca cct ggt gga tta cga get tat atg gta 1392 
He Glu Ser Pro Ser Gly Thr Pro Gly Gly Leu Arg Ala Tyr Met Val 
450 455 460 

tct gtg cat aat aga aaa aat aat ate tat gee gtg cat gaa aat ggt 1440 
Ser val His Asn Arg Lys Asn Asn He Tyr Ala Val His Glu Asn Gly 
465 470 475 480 

act atg att cat tta gcg ceg gaa gat tat aca gga ttc ace ata teg 1488 
Thr Met He His Leu Ala Pro Glu Asp Tyr Thr Gly Phe Thr He Ser 
485 490 495 

ceg ata eat gca act caa gtg aat aat caa acg cga aca ttt att tct 1536 
Pro He His Ala Thr Gin Val Asn Asn Gin Thr Arg Thr Phe He Ser 
500 505 510 

gaa aaa ttt gga aat caa ggt gat tee tta aga ttt gaa caa age aac 
Glu Lys Phe Gly Asn Gin Gly Asp Ser Leu Arg Phe Glu Gin Ser Asn 

515 520 525 

acg aca gca cgt tat aca ctt aga gga aat gga aat agt tac aat ctt 1632 
Thr Thr Ala Arg Tyr Thr Leu Arg Gly Asn Gly Asn Ser Tyr Asn Leu 
530 535 540 



1584 



1680 
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10 



13 



aac ggt agg gtt tat act get tea aat gtt aat act act aca aat aac 1728 
Asn Gly Arg Val Tyr Thr Ala Ser Asn Val Asn Thr Thr Thr Asn Asn 
565 570 575 

gat gga gtt aat gat aat ggc get cgt ttt tta gat att aat atg ggt 1776 
Asp Gly Val Asn Asp Asn Gly Ala Arg Phe Leu Asp He Asn Met Gly 
580 585 590 

aat gta gta gca agt gat aat act aat gta ccg tta gat ata aat gtg 1824 
Asn Val Val Ala Ser Asp Asn Thr Asn Val Pro Leu Asp He Asn Val 
595 600 605 



aea ttt aac tec ggt act caa ttt gag ctt atg aat att atg ttt gtt 
Thr Phe Asn Ser Gly Thr Gin Phe Glu Leu Met Asn He Met Phe Val 
610 615 620 I 



cea act aat ctt cca cca ata tat taa 
Pro Thr Asn Leu Pro Pro He Tyr 
20 625 630 



<210> 2 
<211> 632 
25 <212> PRT 

<213> Bacillus thuringiensis 





<400> 2 


























Ala 






Met 


Asn 


Asn 


Val 


Leu 


Asn 


Asn 


Gly Arg 


Thr 


Thr 


He 


Cys 


Asp 


Tyr 


30 


1 








5 










10 










15 


T^p 




Asn 


Val 


Val 


Ala 
20 


His 


Asp 


Pro 


Phe 


Ser 
25 


Phe 


Glu 


His 


Lys 


Ser 
30 


Leu 




Thr 


He 


Arg 
35 


Lys 


Glu 


Trp 


Met 


Glu 
40 


Trp 


Lys 


Arg 


Thr 


Asp 
45 


His 


Ser 


Leu 


35 


Tyr 


Val 


Ala 


Pro 


He 


Val 


Gly 


Thr 


Val 


Ser 


Ser 


Phe 


Leu 


Leu 


Lys 


Lys 




50 










55 










60 












Val 


Gly 


Ser 


Leu 


He 


Gly 


Lys 


Arg 


He Leu Ser Glu Leu Trp Gly 


Leu 




65 








70 










75 










80 




He 


Phe 


Pro 


Ser 


Gly 


Ser 


Thr 


Asn 


lieu 


Met 


Gin Asp 


He Leu 


Arg 


Glu 


40 










85 










90 










95 






Thr 


Glu 


Gin 


Phe 
100 


Leu 


Asn 


Gin 


Arg 


Leu 
105 


Asn 


Thr 


Asp 


Thr 


Leu 
110 


Ala 


Arg 




val 


Asn 


Ala 
115 


Glu 


Leu 


Glu 


Gly 


Leu 
120 


Gin 


Ala 


Asn 


He 


Arg 
125 


Glu 


Phe 


Asn 


45 


Gin 


Gin 
130 


Val 


Asp 


Asn 


Phe 


Leu 
135 


Asn 


Pro 


Thr 


Gin 


Asn 
140 


Pro 


Val 


Pro 


Leu 




Ser 


He 


Thr 


Ser 


Ser 


Val 


Asn 


Thr 


Met 


Gin 


Gin 


Leu 


Phe 


Leu Asn Arg 




145 










150 










155 










160 




Leu 


Pro 


Gin 


Phe 


Arg 


Val 


Gin 


Gly Tyr 


Gin 


Leu 


Leu 


Leu 


Leu 


Pro 


Leu 


50 










165 










170 










175 


Val 


Phe 


Ala 


Gin 


Ala 

180 


Rla 


Asn 


Met 


His 


Leu 
185 


Ser 


Phe 


He 


Arg 


Asp 
190 


Val 




Leu 


Asn 


Ala 


Asp 


Glu 


Trp 


Gly 


He 


Ser 


Ala 


Ala 


Thr 


Leu 


Arg 


Thr 


Tyr 








195 










200 










205 






He 


55 


Gin 


Asn 


Tyr 


Leu 


Lys 


Asn 


Tyr 


Thr 


Thr 


Glu 


Tyr 


Ser 


Asn 


Tyr 


Cys 






210 








215 










220 












Asn 


Thr 


Tyr 


Gin 


Thr 


Ala 


Phe 


Arg 


Gly 


Leu 


Asn 


Thr 


Arg 


Leu 


His 


Asp 



1872 



1899 
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225 230 235 240 

Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 

245 250 255 

Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Leu Val Ser Ser Gly 
5 260 265 270 

Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Gin Gin Thr Gin Ser Phe 

275 280 285 

Thr Ser Gin Asp Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser 
290 295 300 

10 Asn Tyr Val Leu Asn Gly Phe Ser Gly Ala Arg Leu Thr Gin Thr Phe 
305 310 315 320 

Pro Asn He Gly Gly Leu Pro Gly Thr Thr Thr Thr His Ala Leu I*eu 

325 330 335 

Ala Ala Arg Val Asn Tyr Ser Gly Gly Val Ser Ser Gly Asp He Gly 
IS 340 345 ' 350 

Ala Val Phe Asn Gin Asn Phe Ser Cys Ser Thr Phe Leu Pro Pro Leu 

355 360 365 

Leu Thr Pro Phe Val Arg Ser Trp Leu Asp Ser Gly Ser Asp Arg Gly 
370 375 380 

20 Gly Val Asn Thr Val Thr Asn Trp Gin Thr Glu Ser Phe Glu Ser Thr 
385 390 395 400 

Leu Gly Leu Arg Cys Gly Ala Phe Thr Ala Arg Gly Asn Ser Asn Tyr 

405 410 415 

Phe Pro Asp Tyr Phe He Arg Asn He Ser Gly Val Pro Leu Val Val 
25 420 425 430 

Arg Asn Glu Asp Leu Arg Arg Pro Leu His Tyr Asn Glu He Arg Asn 

435 440 445 

He Glu Ser Pro Ser Gly Thr Pro Gly Gly Leu Arg Ala Tyr Met Val 
450 455 460 

30 Ser Val His Asn Arg Lys Asn Asn He Tyr Ala Val His Glu Asn Gly 
465 470 475 480 

Thr Met He His Leu Ala Pro Glu Asp Tyr Thr Gly Phe Thr He Ser 

485 490 495 

Pro He His Ala Thr Gin Val Asn Asn Gin Thr Arg Thr Phe He Ser 
35 500 505 510 

Glu Lys Phe Gly Asn Gin Gly Asp Ser Leu Arg Phe Glu Gin Ser Asn 

515 520 525 

Thr Thr Ala Arg Tyr Thr Leu Arg Gly Asn Gly Asn Ser Tyr Asn Leu 
530 535 540 

40 Tyr Leu Arg Val Ser Ser Leu Gly Asn Ser Thr He Arg Val Thr He 
545 550 555 560 

Asn Gly Arg Val Tyr Thr Ala Ser Asn Val Asn Thr Thr Thr Asn Asn 

565 570 575 

Asp Gly Val Asn Asp Asn Gly Ala Arg Phe Leu Asp He Asn Met Gly 
45 5 8 0 5 8 5 5 9 0 

Asn Val Val Ala Ser Asp Asn Thr Asn Val Pro Leu Asp He Asn Val 

595 600 605 

Thr Phe Asn Ser Gly Thr Gin Phe Glu Leu Met Asn He Met Phe Val 
610 615 620 

50 Pro Thr Asn Leu Pro Pro He Tyr 
625 630 



55 <210> 3 

<211> 729 
<212> DNA 
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-6 



<213> Bacillus thuringiensis 

ttcgctagga accaagccat ttctagatta gaaggactaa gcaatcttta tcaaatttac 60 
gcagaatctt ttagagagtg ggaagcagat cctactaatc cagcattaag agaagagatg 120 
cgtattcaat tcaatgacat gaacagtgcc cttacaaccg ctattcctct tttggcagtt 180 
caaaattatc aagttcctct tttatcagta tatgttcaag ctgcaaattt acatttatca 240 
gttttgagag atgtttcagt gtttggacaa aggtggggat ttgatgccgc gactatcaat 300 
agtcgttata atgatttaac taggcttatt ggcaactata cagattatgc tgtgcgctgg 360 
tacaatacgg gattagagcg tgtatgggga ccggattcta gagattgggt aaggtataat 420 
caatttagaa gagagctaac acttactgta ttagatatcg ttgctctatt ctcaaattat 480 
gatagtcgaa ggtatccaat tcgaacagtt tcccaattaa caagagaaat ttatacgaac 540 
ccagtattag aaaattttga tggtagtttt cgtggaatgg ctcagagaat agaacagaat 600 
attaggcaac cacatcttat ggatatcctt aatagtataa ccatttatac tgatgtgcat 660 
agaggcttta attattggtc agggcatcaa ataacagctt ctcctgtagg gttttcagga 720 
ccagaattc 



<210> 4 

20 <211> 243 

<212> PRT 

<213> Bacillus thuringiensis 



25 



30 



40 



45 



<400> 4 , , 

Phe Ala Arg Asn Gin Ala He Ser Arg Leu Glu Gly Leu Ser Asn Leu 
1 5 10 15 

Tvr Gin He Tyr Ala Glu Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr 
20 25 30 

Asn Pro Ala Leu Arg Glu Glu Met Arg He Gin Phe Asn Asp Met Asn 
35 40 45 



Ser Ala Leu Thr Thr Ala He Pro Leu Leu Ala Val Gin Asn Tyr Gin 

35 50 55 60 

Val Pro Leu Leu Ser Val Tyr Val Gin Ala Ala Asn Leu His Leu Ser 

65 70 75 80 



val Leu Arg Asp Val Ser Val Phe Gly Gin Arg Trp Gly Phe Asp Ala 
85 90 95 

Ala Thr He Asn Ser Arg Tyr Asn Asp Leu Thr Arg Leu He Gly Asn 

100 105 HO 

Tvr Thr Asp Tyr Ala Val Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val 

115 120 125 



Trp Gly Pro Asp Ser Arg Asp Trp Val Arg Tyr Asn Gin Phe Arg Arg 
50 130 135 140 

Glu Leu Thr Leu Thr Val Leu Asp He Val Ala Leu Phe Ser Asn Tyr 

145 150 155 160 



55 



Asp Ser Arg Arg Tyr Pro He Arg Thr Val Ser Gin Leu Thr Arg Glu 
165 170 175 
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Ile Tyr Thr Asn Pro Val Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly 
180 185 190 

Met Ala Gin Arg lie Glu Gin Asn lie Arg Gin Pro His Leu Met Asp 
S 195 200 205 

lie Leu Asn Ser lie Thr He Tyr Thr Asp Val His Arg Gly Phe Asn 
210 215 220 

10 Tyr Trp Ser Gly His Gin He Thr Ala Ser Pro Val Gly Phe Ser Gly 
225 230 235 240 

Pro Glu Phe 

IS 



20 



<210> 5 
<211> 1959 
<212> DNA 

<213> Bacillus thuringiensis 



<400> 5 

gaaaatgaga 

caaatggatc 

25 aatatcaatc 
atattgggcg 
gttggggaat 
caacttataa 
ggtctaggaa 

30 aatgatgcaa 
actactgcta 
gctcaagctg 
tgggggatgg 
gaatattcta 

35 aatgctgaaa 
gatttagtag 
cagttaacaa 
gcaagtacga 
ttcaggcctc 

40 cgttggagta 
ataggaggga 
gtaacattac 
aatatactat 
ctcagaatat 

45 ttcaattatt 
aatcatatag 
gtctattctt 
actcaaattc 
ggatttactg 

SO gggtatattg 
cgttatgctt 
acgaacacat 
tatgttgaaa 
tttagtgcaa 

55 ttcgaggtag 



atgaaattat 

tatcgctaga 

cacttgtCag 

tattaggtgt 

tatggcctag 

gacaacaagt 

gaggctatag 

gatcaagaag 

taccgctttt 

caaatttaca 

catcttccga 

accattgcgt 

gttggttgcg 

ccctattccc 

gagaaattta 

attggtttaa 

cgcatctact 

gcactcaaca 

cattaaatac 

attacgtttc 

ttactacfccc 

ttatgaaaga 

tgattcagaa 

tcatagatat 

ggacgcatcg 

ctgcagtgaa 

gtggagacgt 

aagttccaat 

ctgtaacctc 

taccagcaac 

tcaacaatgc 

atgcagaagt 

aatatgattt 



aaatgcctta 

tgctcgtatt 

cgcatcaaca 

gccgtttgct 

tggtagagat 

aacagaaaat 

atcttaccag 

cattattctt 

cagaatacga 

cctattatta 

tgttaaccaa 

acaatggtat 

gtataatcaa 

aagctatgat 

tacagatcca 

taataatgca 

tgattttcca 

tatgaattat 

ctcaacacaa 

gtctcgtgac 

tgtgaatgga 

ggcgccacta 

actgaattac 

ctcatataga 

tagtgcagat 

gggaagattt 

agttagattg 

tcaattcacg 

gattgagctc 

agctgcatca 

ttttacatcc 

aataatagac 

agaaagagca 



tcgattccag 

gaggattctt 

gtccaaacgg 

ggacaactag 

ccatgggaaa 

actaggaata 

caggctcttg 

gagcgctatg 

aatgaagaag 

ttgagagacg 

tattaccagg 

aatacagggc 

ttccgtagag 

actcgcactt 

attgggagaa 

ccatcgtttt 

gaacaactta 

tgggtgggac 

ggacttacta 

gtttatagaa 

gtaccttggg 

cctacagtca 

caccagaaac 

ctaatcatag 

cgtacgaata 

ctttttaatg 

aataggaata 

tcgacatcta 

aatgttaatt 

ttagataatc 

gcaacaggta 

agatttgaat 

caaaaggcg 



ctgtatcgaa 
tgtgtatagc 
gtataaacat 
ctagttttta 
ttttcctgga 
cggctattgc 
aaacttggtt 
ttgctttaga 
ttccattatt 
catccctttt 
aacaaatcag 
taaataactt 
acctaacgtt 
atccaatcaa 
caaatgcacc 
ctgccataga 
caatttacag 
ataggcttaa 
ataatacttc 
cagaatcaaa 
ctagatttaa 
accgtatcag 
aacagaacga 
gaaacacttt 
cgattggacc 
gttctgtgat 
atggtaatat 
ccagatatcg 
tgggcaattc 
tacaatcagg 
atatagtagg 
ttatcccagt 



tcattccgca 
cgaggggaat 
agctggtaga 
tagttttctt 
atatgtagaa 
tcgattagaa 
agataaccga 
acttgacatt 
aatggtatat 
tggtagtgaa 
atatacagag 
aagagggaca 
aggggtatta 
tacgagtgct 
ttcaggattt 
ggctgccatt 
tgcatcaagc 
cttccgccca 
aattaatcct 
tgcagggaca 
ttttataacc 
ggagttggga 
ccaaattatg 
gagagcacca 
aaatagaatt 
ttcaggacca 
ccaaaataga 
agttcgagta 
atcaattttt 
ggattttggt 
tgctagaaat 
tactgcaacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1959 
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<210> 6 
<211> 653 
<2X2> PRT 

<213> Bacillus thuringiensis 

5 

<400> 6 

Glu Asn Glu Asn Glu He lie Asn Ala Leu Ser He Pro Ala val Ser 
X 5 10 15 

10 Asn His ser Ala Gin Met Asp Leu Ser Leu Asp Ala Arg He Glu Asp 
20 25 30 

Ser Leu Cys He Ala Glu Gly Asn Asn He Asn Pro Leu Val Ser Ala 
35 40 45 

Ser Thr Val Gin Thr Gly He Asn He Ala Gly Arg He Leu Gly Val 
50 55 60 

Leu Gly val Pro Phe Ala Gly Gin Leu Ala Ser Phe Tyr Ser Phe Leu 
20 65 70 75 80 

Val Gly Glu Leu Trp Pro Ser Gly Arg Asp Pro Trp Glu He Phe Leu 
85 90 95 

25 Glu Tyr Val Glu Gin Leu He Arg Gin Gin Val Thr Glu Asn Thr Arg 
100 105 110 

Asn Thr Ala He Ala Arg Leu Glu Gly Leu Gly Arg Gly Tyr Arg Ser 
115 - 120 125 

^° Tyr Gin Gin Ala Leu Glu Thr Trp Leu Asp Asn Arg Asn Asp Ala Arg 
130 135 140 

Ser Arq Ser He He Leu Glu Arg Tyr Val Ala Leu Glu Leu Asp He 
35 145 150 155 160 ^ 

Thr Thr Ala He Pro Leu Phe Arg He Arg Asn Glu Glu Val Pro Leu 
165 170 175 

40 Leu Met val Tyr Ala Gin Ala Ala Asn Leu His Leu Leu Leu Leu Arg 
180 185 190 

Asp Ala ser Leu Phe Gly Ser Glu Trp Gly Met Ala Ser Ser Asp Val 
195 200 205 

Asn Gin Tyr Tyr Gin Glu Gin He Arg Tyr Thr Glu Glu Tyr Ser Asn 
210 215 220 

His Cys Val Gin Trp Tyr Asn Thr Gly Leu Asn Asn Leu Arg Gly Thr 
50 225 230 235 240 

Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gin Phe Arg Arg Asp Leu Thr 
245 250 255 

55 Leu Gly val Leu Asp Leu Val Ala Leu Phe Pro Ser Tyr Asp Thr Arg 
260 265 270 
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Thr Tyr Pro He Asn Thr Ser Ala Gin Leu Thr Arg Glu He Tyr Thr 
275 280 285 

ASP Pro He Gly Arg Thr Asn Ala Pro Ser Gly Phe Ala Ser Thr Asn 
5 290 295 300 

Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala He Glu Ala Ala He 
305 310 315 320 

10 Phe Arg Pro Pro His Leu Leu Asp Phe Pro Glu Gin Leu Thr He Tyr 
325 330 335 

ser Ala Ser Ser Arg Trp Ser Ser Thr Gin His Met Asn Tyr Trp Val 
340 345 350 

\ Gly His Arg Leu Asn Phe Arg Pro He Gly Gly Thr Leu Asi^ Thr Ser 

' 355 360 3S5 

Thr Gin Gly Leu Thr Asn Asn Thr Ser He Asn Pro Val Thr Leu His 
20 370 375 380 

Tvr val ser Ser Arg Asp Val Tyr Arg Thr Glu Ser Asn Ala Gly Thr 
385 390 395 400 

25 Asn He Leu Phe Thr Thr Pro Val Asn Gly Val Pro Trp Ala Arg Phe 
405 410 . 415 

Asn Phe He Thr Leu Arg He Phe Met Lys Glu Ala Pro Leu Pro Thr 
420 425 430 

val Asn Arg He Arg Glu Leu Gly Phe Asn Tyr Leu He Gin Lys Leu 
435 440 445 

Asn Tyr His Gin Lys Gin Gin Asn Asp Gin He Met Asn His He Val 
35 4 5 0 4 5 5 4 60 

He ASP He Ser Tyr Arg Leu He He Gly Asn Thr Leu Arg Ala Pro 

Ann 475 480 

465 470 

40 val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn Thr He Gly 
485 490 495 

Pro Asn Arg He Thr Gin He Pro Ala Val Lys Gly Arg Phe Leu Phe 
500 505 510 

''^ Asn Gly ser Val He Ser Gly Pro Gly Phe Thr Gly Gly Asp Val Val 
515 520 525 

Arg Leu Asn Arg Asn Asn Gly Asn He Gin Asn Arg Gly Tyr He Glu 
50 530 535 540 

val Pro He Gin Phe Thr Ser Thr Ser Thr Arg Tyr Arg Val Arg Val 
545 550 555 560 

55 Ar«? Tyr Ala Ser Val Thr Ser He Glu Leu Asn Val Asn Leu Gly Asn 
565 570 575 
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Ser Ser lie Phe Thr Asn Thr Leu Pro Ala Thr Ala Ala Ser Leu Asp 
580 585 590 

Asn Leu Gin Ser Gly Asp Phe Gly Tyr Val Glu lie Asn Asn Ala Phe 
S 595 600 605 

Thr Ser Ala Thr Gly Asn lie Val Gly Ala Arg Asn Phe Ser Ala Asn 
610 615 620 

10 Ala Glu Val lie lie Asp Arg Phe Glu Phe lie Pro Val Thr Ala Thr 
625 630 635 640 

Phe Glu Val Glu Tyr Asp Leu Glu Arg Ala Gin Lys Ala 
645 650 

IS 

<210> 7 
<211> 328 
<212> DNA 
20 <213> Bacillus thuringiensis 

<400> 7 

ctttacagga agattaccac aaagttatta tatcgtttcc gttatgcttc gggagcaaat 60 
aggagtggtt cattaagtta ttcacagcaa acttcgtatg taatttcatt tccaaaaact 120 
25 atggacgcag gtgaaccact aacatctcgt tcgttcgctt ttacaacaac cgtcactcca 180 
atagccttta cacgagctca agaagaattt gatttataca tccaacagaa tgtttatata 240 
gatagagttg aatttatccc agtagatgca acatttgagg caaaatctga tttagaaaga 300 
gcgaaaaagg cggtgaatgc cttgttta 328 

30 

<210> 8 
<211> 109 
<212> PRT 

<213> Bacillus thuringiensis 

<4oo> 8 

Leu Tyr Arg Lys He Thr Thr Lys Leu Leu Tyr Arg Phe Arg Tyr Ala 
15 10 15 

40 Ser Gly Ala Asn Arg Ser Gly Ser Leu Ser Tyr Ser Gin Gin Thr Ser 
20 25 30 

Tyr Val He Ser Phe Pro Lys Thr Met Asp Ala Gly Glu Pro Leu Thr 
35 40 45 



35 



45 



Ser Arg Ser Phe Ala Phe Thr Thr Thr Val Thr Pro He Ala Phe Thr 
50 55 60 



Arg Ala Gin Glu Glu Phe Asp Leu Tyr He Gin Gin Asn Val Tyr He 
50 6 5 7 0 7 5 8 0 

Asp Arg Val Glu Phe lie Pro Val Asp Ala Thr Phe Glu Ala Lys Ser 
85 90 95 

55 Asp Leu Glu Arg Ala Lys Lys Ala Val Asn Ala Leu Phe 
100 105 
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<210> 9 
<211> 340 
<212> DNA 

<213> Bacillus thuringiensis 
<400> 9 

ttacgagtaa cctttacagg aagattacca caaagttatt 
tcgggagcaa ataggagtgg ttcattaagt tattcacagc 
tttccaaaaa ctatggacgc aggtgaacca ctaacatctc 
accgtcactc caataacctt tacacgagct caagaagaat 
aatgtttata tagatagagt tgaatttatc ccagtagatg 
gaCttagaaa gagcgaaaaa ggcggtgaat gccttgttta 



atatacgttt 
aaacttcgta 
gttcgttcgc 
ttgatttata 
caacatttga 



ccgttatgct 60 
tgtaatttca 120 
ttttacaaca 180 
catccaacag 24 0 
ggcaaaatct 300 
340 



IS 



20 



25 



30 



35 



40 



<210> 10 
<211> 113 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 10 

Leu Arg Val Thr Phe Thr Giy Arg Leu Pro Gin Ser Tyr 
15 10 

Phe Arg Tyr Ala Ser Gly Ala Asn Arg Ser Gly Ser Leu 
20 25 

Gin Gin Thr Ser Tyr Val lie Ser Phe Pro Lys Thr Met 
35 40 45 

Glu Pro Leu Thr Ser Arg Ser Phe Ala Phe Thr Thr Thr 
50 55 60 

lie Thr Phe Thr Arg Ala Gin Glu Glu Phe Asp Leu Tyr 
65 70 75 

Asn Val Tyr He Asp Arg Val Glu Phe He Pro Val Asp 
85 90 

Glu Ala Lys Ser Asp Leu Glu Arg Ala Lys Lys Ala Val 
100 105 



Tyr He Arg 
15 

Ser Tyr Ser 
30 

Asp Ala Gly 
Val Thr Pro 



He Gin Gin 
80 

Ala Thr Phe 
95 

Asn Ala Leu 
110 



45 



Phe 



<210> 11 

<211> 306 

<212> DNA 

50 <213> Bacillus thuringiensis 

<400> 11 

gtatcgcgtg agatcgtatg ctctacgaca gatttacaat 

actactatta atattggtaa tttctcgagc actatggaca 

55 ggaagattca gggttgcagg ttttactact ccatttacct 

ttcacaatag gtgcttttgg cttctctcca aacaacgaag 

tttgtcccgg cagaagtaac atttgaggca gaatatgatt 



tctatacgaa 
gtggggatga 
tttcagatgc 
tttatataga 
tagagaaagc 



tattaatgga 60 
tttacagtac 120 
aaacagcaca 180 
tcgaattgaa 240 
tcagaaagcg 300 
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gtgaat 



<210> 12 
5 <211> 102 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 12 

10 Val Ser Arg Glu lie Val Cys Ser Thr Thr Asp Leu Gin Phe Tyr Thr 
1 5 10 15 

Asn lie Asn Gly Thr Thr lie Asn lie Gly Asn Phe Ser Ser Thr Met 
20 25 30 



IS 



30 



Asp Ser Gly Asp Asp Leu Gin Tyr Gly Arg Phe Arg Val Ala Gly Phe 
35 40 45 



Thr Thr Pro Phe Thr Phe Ser Asp Ala Asn Ser Thr Phe Thr lie Gly 

20 5 0 5 5 6 0 

Ala Phe Gly Phe Ser Pro Asn Asn Glu Val Tyr He Asp Arg He Glu 
65 70 75 80 

25 Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr Asp Leu Glu Lys 

85 90 95 



Ala Gin Lys Ala Val Asn 
100 



3 06 



<210> 13 
<211> 279 
<212> DNA 
35 <213> Bacillus thuringiensis 

<400> 13 

caattccata catcaattga cggaagacct attaatcagg ggaatttttc agcaactatg 60 

agtagtggga gtaatttaca gtccggaagc tttaggactg taggttttac tactccgttt 120 

40 aacttttcaa atggatcaag tgtatttacg ttaagtgctc atgtcttcaa ttcaggcaat 180 

gaagtttata tagatcgaat tgaatttatt ccggcagaag taacctttga ggcagaatat 240 

gatttagaaa gagcacaaaa ggcggtgaat gagctgttt 279 



45 <210> 14 
<211> 93 

<212> PRT 

<213> Bacillus thuringiensis 
50 <400> 14 

Gin Phe His Thr Ser He Asp Gly Arg Pro He Asn Gin Gly Asn Phe 
15 10 15 

Ser Ala Thr Met Ser Ser Gly Ser Asn Leu Gin Ser Gly Ser Phe Arg 
55 20 25 30 



Thr Val Gly Phe Thr Thr Pro Phe Asn Phe Ser Asn Gly Ser Ser Val 
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40 



45 



50 



55 



13- 



35 40 45 

Phe Thr Leu Ser Ala His Val Phe Asn Ser Gly Asn Glu Val Tyr lie 
50 55 60 

Asp Arg lie Glu Phe He Pro Ala Glu Val Thr Phe Glu Ala Glu Tyr 
65 70 75 80 

Asp Leu Glu Arg Ala Gin Lys Ala Val Asn Glu Leu Phe 
85 90 



<210> 15 
<211> 397 
15 <212> DNA 

<213> Bacillus thuringiensis 

e400> 15 

aggaccaggt tttacaggtg ggatatcctt cgaagaacga atgttggtag ctttggagat 60 

20 atgcgtgtaa acattactgc accactatca caaagatatc gcgtaagaat tcgctatgct 120 

tctacgacag atttacaatt tttcacgaga atcaatggaa cttctgtaaa tcaaggtaat 180 

ttccaaagaa ctatgaatag agggggtaat ttagaatctg gaaactttag gactgcagga 240 

tttagtacgc cttttagttt tttcaaatgc gcaaagtaca ttcacattgg gtactcaggc 300 

ttttcaaatc aggaagttta tatagatcga attgaatttg tcccggcaga agtaacattc 360 

25 gaggcagaat ctgatttgga aagagcgcaa aaggcgg 397 

<210> 16 

<211> 132 

30 <212> PRT 

<213> Bacillus thuringiensis 



<400> 16 

Arg Thr Arg Phe Tyr Arg Trp Asp He Leu Arg Arg Thr Asn Val Gly 
15 10 15 

Ser Phe Gly Asp Met Arg Val Asn He Thr Ala Pro Leu Ser Gin Arg 
20 25 30 

Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr Asp Leu Gin Phe Phe 
35 40 45 

Thr Arg He Asn Gly Thr Ser Val Asn Gin Gly Asn Phe Gin Arg Thr 
50 55 60 

Met Asn Arg Gly Gly Asn Leu Glu Ser Gly Asn Phe Arg Thr Ala Gly 
65 70 75 80 

Phe Ser Thr Pro Phe Ser Phe Phe Lys Cys Ala Lys Tyr He His He 
85 90 95 

Gly Tyr Ser Gly Phe Ser Asn Gin Glu Val Tyr He Asp Arg He Glu 

100 105 110 

Phe Val Pro Ala Glu Val Thr Phe Glu Ala Glu Ser Asp Leu Glu Arg 
lis 120 125 



BNSDOCID: <WO 0119859A2J_ 



wo 01/19859 



PCTAJSOO/25361 




5 <2105. 17 

<211> 123 

<212> DNA 

<213> Bacillus thuringiensis 



10 <400> 17 

ataatctaca atcaggggga ttttggttat gttgaaatca acaatgcttt tacatccgca 60 
acaggtaata tagtaggtgc tagaaatttt acgtgcaaat gcagaagtaa taatagacag 120 
att 



15 

<210> 18 I 
<211> 41 
<212> PRT 

<213> Bacillus thuringiensis 

20 

<400> 18 

lie lie Tyr Asn Gin 61y Asp Phe Gly Tyr Val Glu lie Asn Asn Ala 
15 10 15 

25 Phe Thr Ser Ala Thr Gly Asn lie Val Gly Ala Arg Asn Phe Thr Cys 
20 25 30 



30 



Lys Cys Arg Ser Asn Asn Arg Gin lie 
35 40 



<210> 19 
<211> 192 
<212> DHA 
35 <213> Bacillus thuringiensis 

<400> 19 

agttattata tacgtttccg ttatgcttcc 
tcacagccaa cttcatatgg aatttccttt 
40 acatctcgtt catttgcact tgctacactt 
gaattaaatc ta 



gtagctaata ggagtggtat atttagctat 60 
ccaaaaacta tggatgcaga tgaatcatta 120 
gctacaccgc taacctttag aaggcaagaa 180 

192 



<210> 20 
45 <211> 64 
<212> PRT 

<213> Bacillus thuringiensis 



50 



55 



<400> 20 ' 

Ser Tyr Tyr lie Arg Phe Arg Tyr Ala Ser Val Ala Asn Arg Ser Gly 
15 10 15 

lie Phe Ser Tyr Ser Gin Pro Thr Ser Tyr Gly He Ser Phe Pro Lys 
20 25 30 

Thr Met Asp Ala Asp Glu Ser Leu Thr Ser Arg Ser Phe Ala Leu Ala 
35 40 45 
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<210> 21 
10 <2ai> 3507 
<212> DMA 

<213> Bacillus thuringiensis 

<220> 
15 <221> CDS 

<222> (1) . . (3507) 



20 



25 



35 



40 



atrga^ata aat aat cag aac caa tgc ata cca tat aat tgc tta agt 48 
Mel 111 lie Asn Asn Gin Asn Gin Cys He Pro Tyr Asn Cys Leu Ser 
1 5 10 



aat cct gag gaa gta ttt ttg gat ggg gag agg ata tta cct gat ate 
tin Pro llu llu ?al Phe Leu Asp Gly Glu Arg He Leu Pro Asp He 



20 



25 30 



gat cca etc gaa gtt tct ttg teg ctt ttg caa ttt ctt ttg aat aac 
isp Pro Leu Glu Val Ser Leu Ser Leu Leu Gin Phe Leu Leu Asn Asn 
35 40 



50 



55 60 



ggg get ttg aga cca tct gat tgg gaa tta ttt ctt gca cag att gaa 
III Ala Leu A?g Pro Ser Asp Trp Glu Leu Phe Leu Ala Gin He Glu 
65 70 75 80 

cag ttg att gat cga aga ata gaa aga aca gta aga gca aaa gca ate 
Gin Leu He Asp Arg Arg He Glu Arg Thr Val Arg Ala Lys Ala He 
85 90 95 



96 



144 



45 

'° ttt gtt cca ggg ggg ggg ttt att tea gga tta ctt gat aaa ata tgg 192 
Phe Val Pro Gly Gly Gly Phe He Ser Gly Leu Leu Asp Lys He Trp 



240 



288 



336 



384 



get gaa tta gaa ggt tta ggg aga agt tat caa eta tat gga gag gca 
III lln Leu Glu Gly Leu Gly Arg Ser Tyr Gin Leu Tyr Gly Glu Ala 
45 100 105 110 

ttt aaa qag tgg gaa aaa act cca gat aac aca geg get egg tct aga 
Phe Lys Glu Trp Glu Lys Thr Pro Asp Asn Thr Ala Ala Arg Ser Arg 
115 120 125 

eta act gag aga ttt cgt ata att gat get caa att gaa gca aat ate 432 
?al ?hr 111 Arg Phe Arg He He Asp Ala Gin He Glu Ala Asn He 
130 135 140 

55 cct teg ttt egg gtt tec gga ttt gaa gtg cca ctt eta ttg gtt tat 480 
Pro ser Phe Arg Val Ser Gly Phe Glu Val Pro Leu Leu Leu Val Tyr 
1^5 150 155 160 
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acc caa gca get aat ttg cat etc get eta tta aga gat tct gtt gtt 528 
Thr Gin Ala Ala Asn Leu His Leu Ala Leu Leu Arg Asp Ser Val Val 
165 170 175 

5 

ttt gga gag aga tgg gga ttg acg act aca aat gtc aat gat ate tat 576 
Phe Gly Glu Arg Trp Gly Leu Thr Thr Thr Asn Val Asn Asp lie Tyr 
180 185 190 

10 aat aga caa gtt aat aga att ggt gaa tat age aag cat tgt gta gat 624 
Asn Arg Gin Val Asn Arg He Gly Glu Tyr Ser Lys His Cys Val Asp 
195 200 205 

acg tat aaa aca gaa tta gaa cgt eta gga ttt aga tct ata geg caa 672 
15 Thr Tyr Lys Thr Glu Leu Glu Arg Leu Gly Phe Arg Ser He Ala Gin 
210 215 220 ' 

tgg aga ata tat aat eag ttt aga agg gaa ttg aca eta acg gta tta 720 
Trp Arg He Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val Leu 
20 225 230 235 240 

gat att gtc get gtt ttc ccg aac tat gat agt aga ctg tat ceg att 768 

Asp He Val Ala Val Phe Pro Asn Tyr Asp Ser Arg Leu Tyr Pro He 
245 250 255 

25 

cga aca att tet caa ttg aca aga gaa att tat aca tec eca gta age 816 

Arg Thr He Ser Gin Leu Thr Arg Glu He Tyr Thr Ser Pro Val Ser 
260 265 270 

30 gaa ttt tat tat ggt gtc att aat agt aat aat ata att ggt acc ett 864 
Glu Phe Tyr Tyr Gly Val He Asn Ser Asn Asn He He Gly Thr Leu 
275 280 285 

act gaa eag caa ata agg cga cca cat ett atg gac ttc ttt aac tec 912 
35 Thr Glu Gin Gin He Arg Arg Pro His Leu Met Asp Phe Phe Asn Ser 
290 295 300 

atg ate atg tat acg tea gat aat aga cga gaa cat tat tgg tea gga 960 
Met He Met Tyr Thr Ser Asp Asn Arg Arg Glu His Tyr Trp Ser Gly 
40 305 310 315 320 

ett gaa atg acg get act aat act gag gga eat caa agg tea ttc ect 1008 

Leu Glu Met Thr Ala Thr Asn Thr Glu Gly His Gin Arg Ser Phe Pro 

325 330 335 

45 

tta get ggg act ata ggg aat tea get eea cea gta act gtt aga aat 1056 

Leu Ala Gly Thr He Gly Asn Ser Ala Pro Pro Val Thr Val Arg Asn 

340 345 350 

50 aat ggt gag gga att tat aga ata tta teg gaa cca ttt tat tea gca 1104 
Asn Gly Glu Gly He Tyr Arg He Leu Ser Glu Pro Phe Tyr Ser Ala 
355 360 365 

ect ttt eta ggc aca agt gtg eta gga agt cgt ggg gaa gaa ttt get 1152 
55 Pro Phe Leu Gly Thr Ser Val Leu Gly Ser Arg Gly Glu Glu Phe Ala 
370 375 380 
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ttt gca tct aat act act aca agt ctg cca tct aca ata tat aga aat 1200 
Phe Ala Ser Asn Thr Thr Thr Ser Leu Pro Ser Thr lie Tyr Arg Asn 
385 390 395 400 

5 cgt gga aca gta gat tea tta gtc age ata ccg cca cag gat tat age 1248 
Arg Gly Thr Val Asp Ser Leu Val Ser lie Pro Pro Gin Asp Tyr Ser 
405 410 415 

gta cca ccg cae agg ggg tat agt cat tta tta agt cac gtt acg atg 1296 
10 Val Pro Pro His Arg Gly Tyr Ser His Leu Leu Ser His Val Thr Met 
420 425 430 

cgc aat agt tct cct ata ttc cac tgg aca cat cgt agt gca acc cct 1344 
Arg Asn Ser Ser Pro He Phe His Trp Thr His Arg Ser Ala Thr Pro 
15 435 440 445 

aga aat aca att gat cca gat agt ate act caa att cca gca gtt aag 1392 
Arg Asn Thr He Asp Pro Asp Ser He Thr Gin He Pro Ala Val Lys 
450 455 460 

20 

gga gcg tat att ttt aat agt cca gtc att act ggg cca gga cat aca 1440 
Gly Ala Tyr He Phe Asn Ser Pro Val He Thr Gly Pro Gly His Thr 
465 470 475 480 

25 ggt ggg gat ata ata agg ttt aac cct aat act cag aac aac ata aga 1488 
Gly Gly Asp He He Arg Phe Asn Pro Asn Thr Gin Asn Asn He Arg 
485 490 495 

att cca ttt caa tea aat gcg gta cag cgt tat ega att aga atg cgt 1536 
30 He Pro Phe Gin Ser Asn Ala Val Gin Arg Tyr Arg He Arg Met Arg 
500 505 510 

tat gcg gca gaa get gat tgt att tta gaa agt gga gta aac att gtt 1584 
Tyr Ala Ala Glu Ala Asp Cys He Leu Glu Ser Gly Val Asn He Val 
35 515 520 525 

act ggg gca ggg gtc acc ttt agg cca att cct att aaa get aca atg 1632 
Thr Gly Ala Gly Val Thr Phe Arg Pro He Pro He Lys Ala Thr Met 
530 535 540 

40 

act cct gga agt cct tta aca tat tac age ttc cag tat gca gat tta 1680 

Thr Pro Gly Ser Pro Leu Thr Tyr Tyr Ser Phe Gin Tyr Ala Asp Leu 
545 550 555 560 

45 aat ata aat ctt act gcg ccg ata aga cct aat aat ttt gta tct att 1728 
Asn He Asn Leu Thr Ala Pro He Arg Pro Asn Asn Phe Val Ser He 
565 570 575 

aga cgt tea aac caa cca gga aac ctt tat ata gat aga att gaa ttc 1776 
50 Arg Arg Ser Asn Gin Pro Gly Asn Leu Tyr He Asp Arg He Glu Phe 
580 585 590 

att cca att gac cca ate cgt gag gca gaa cat gat tta gaa aga gcg 1824 
He Pro He Asp Pro He Arg Glu Ala Glu His Asp Leu Glu Arg Ala 
55 595 600 605 

caa aag gcg gtg aat gcg ctg ttt act tct tec aat caa eta gga tta 1872 
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10 



IS 



20 



25 



30 



35 



40 



45 



50 



55 



Gin Lys Ala Val Asn Ala Leu Phe Thr Ser Ser Asn Gin Leu Gly Leu 
610 615 620 

aaa aca gat gtg acg gat tat cat att gat caa gtg tec aat tta gtt 1920 
Lys Thr Asp Val Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val 
625 630 635 640 

gcg tgt tta teg gat gaa tte tgc ctg gat gaa aag cga gaa ttg tec 1968 
Ala Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser 
645 650 655 

gag aaa gtt aaa cat gcg aag cga etc agt gat gag aga aat tta etc 2016 
Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu 
660 665 670 

caa gat caa aac ttt aca ggc ate aat agg caa gta gac cgt' ggg tgg 
Gin Asp Gin Asn Phe Thr Gly He Asn Arg Gin Val Asp Arg Gly Trp 
675 680 685 

aga gga agt acg gat att ace ate caa gga ggg aat gat gta tte aaa 2112 
Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asn Asp Val Phe Lys 
690 695 700 



2064 



gag aat tac gtc aca eta cea ggt ace ttt gat gag tgt tac cca acg 
Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr 
705 710 715 720 

tat ttg tat caa aaa ata gat gag tea aaa tta aaa cet tat act egc 
Tyr Leu Tyr Gin Lys He Asjp Glu Ser Lys Leu Lys Pro Tyr Thr Arg 
725 730 735 



2160 



2208 



tat gaa tta aga ggg tat att gaa gat agt caa gac tta gaa gtc tat 22 56 
Tyr Glu Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu Val Tyr 
740 745 750 

ttg ate cgt tac aat gea aaa cac gaa acg tta aat gtg cca ggt acg 2304 
Leu He Arg Tyr Asn Ala Lys His Glu Thr Leu Asn Val Pro Gly Thr 
755 760 765 

ggt tec tta tgg cea ett gea gee gaa agt tea ate ggg agg tgc ggc 2352 
Gly Ser Leu Trp Pro Leu Ala Ala Glu Ser Ser He Gly Arg Cys Gly 
770 775 780 



gaa ecg aat cga tgc gcg cca eat att gaa tgg aat cet gaa eta gat 
Glu Pro Asn Arg Cys Ala Pro His He Glu Trp Asn Pro Glu Leu Asp 
785 790 795 800 



2400 



tgt teg tgt agg gat gga gaa aaa tgt gea cat cat tct cat eat tte 2448 
Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe 
805 810 815 



2496 



tec ttg gat att gat gtt gga tgt aca gac tta aat gag gat tta ggt 
Ser Leu Asp He Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly 
820 825 830 

gta tgg gtg ata ttt aag att aag acg caa gat ggc tat gea aga eta 2544 
Vai Trp Val He Phe Lys He Lys Thr Gin Asp Gly Tyr Ala Arg Leu 
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B35 840 845 

gga aat tta gag ttt etc gaa gag aaa cca ttg tta gga gaa gcg eta 2592 
Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu 
850 855 860 

get cgt gtg aag aga gcg gag aaa aaa tgg aga gac aaa cgc gac aaa 2640 
Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Asp Lys 
865 870 875 880 



ttg gaa tgg gaa aca aat att gtt tat aaa gag gca aaa gaa tct gta 
Leu Glu Trp Glu Thr Asn lie Val Tyr Lys Glu Ala Lys Glu Ser Val 
885 890 895 



aac att gcg atg att cat gtg gca gat aaa cgc gtt cat cga ate cga 
Asn He Ala Met He His Val Ala Asp Lys Arg Val His Arg He Arg 
915 920 925 



att ttc gaa gaa tta gaa ggt ctt att ttc act gca ttc tec eta tat 
He Phe Glu Glu Leu Glu Gly Leu He Phe Thr Ala Phe Ser Leu Tyr 
945 950 955 960 

gat gcg aga aat gtc att aaa aac gga gat ttc aat cat ggt tta tea 
Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn His Gly Leu Ser 
965 970 975 



cgt teg gtc ctt gtt gtt ceg gaa tgg gaa gca gaa gtg tea caa gaa 
Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser Gin Glu 
995 1000 1005 



2688 



gat get tta ttc gta gat tct caa tat aat aga tta caa acg gat aeg 2736 
Asp Ala Leu Phe Val Asp Ser Gin Tyr Asn Arg Leu Gin Thr Asp Thr 
900 905 910 



2784 



gaa gcg tat ttg cca gag tta tct gtg att ccg ggt gtc aat gcg get 2832 
Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val Asn Ala Ala 
25 930 935 940 



2880 



2928 



35 tge tgg aac gtg aaa ggg cat gta gat gta gaa gaa caa aat aac cac 2976 
Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gin Asn Asn His 
980 985 990 



3024 



3072 



gtc cgc gta tgt cca gga cgt ggc tat ate ctg cgt gtt aca gcg tac 

Val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr 
45 1010 1015 1020 

aaa gag ggc tac gga gaa gga tge gta acg ate cat gaa att gaa gat 3120 

Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu Asp 
1025 1030 1035 1040 



cat aca gac gaa ctg aaa ttt aga aac tgt gaa gaa gag gaa gtg tat 3168 
His Thr Asp Glu Leu Lys Phe Arg Asn Cys Glu Glu Glu Glu Val Tyr 
1045 1050 1055 

ccg aat aac aeg gta acg tgt aat gat tat cca gca aat caa gaa gaa 3216 
Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Pro Ala Asn Gin Glu Glu 
1060 1065 1070 



wo 01/19859 



PCT/USOO/25361 



20 



IS 



20 



tac agg get gcg gaa act tec cgt aat cgt gga tat ggc gaa tct tat 
Tyr Arg Ala Ala Glu Thr Ser Arg Asn Arg Gly Tyr Gly Glu Ser Tyr 
1075 1080 



3264 



gaa agt aat tct tec ata cca get gag tat gcg cca att tat gag aaa 3312 
Glu ser Asn Ser Ser He Pro Ala Glu Tyr Ala Pro He Tyr Glu Lys 
1090 1095 1100 

10 gca tat aca gat gga aga aaa gag aat tct tgt gaa tct aac aga gga 
Ala Tyr Thr Asp Gly Arg Lys Glu Asn Ser Cys Glu Ser Asn Arg Gly 
1105 1110 1115 1120 



33G0 



tat gga aat tac aca ccg tta cca gca ggt tat gtg aca aaa gaa tta 3408 
Tyr Gly Asn Tyr Thr Pro Leu Pro Ala Gly Tyr Val Thr Lys Glu Leu 
1125 1130 1135 

qaq tac ttc cca gaa acc gat aag gta tgg ata gag att gga gaa acg 
Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp He Glu He Gly Glu Thr 
1140 1145 1150 

gaa gga aca ttc ate gta gac agt gtg gaa tta etc etc atg gag gaa 
Glu Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1155 1160 1165 

^5 3507 
tag 



3456 



3504 



30 <210> 22 

<211> 1168 
<212> PRT 

<213> Bacillus thuringiensis 
35 <400> 22 



45 



55 



Met 


Glu 


He 


Asn 


Asn 
5 


Gin 


Asn 


Gin 


1 

Asn 


Pro 


Glu 


Glu 


Val 


Phe 


Leu 


Asp 








20 










Asp 


Pro 


Leu 


Glu 


Val 


Ser 


Leu 


Ser 




35 










40 


Phe 


Val 


Pro 


Gly 


Gly 


Gly 


Phe 


He 




50 










55 




Gly Ala 


Leu 


Arg 


Pro 


Ser 


Asp 


Trp 


65 










70 






Gin 


Leu 


He 


Asp 


Arg 


Arg 


He 


Glu 










85 








Ala 


Glu 


Leu 


Glu 


Gly 


Leu 


Gly Arg 








100 










Phe 


Lys 


Glu 


Trp 


Glu 


Lys 


Thr 


Pro 




115 










120 


Val 


Thr 


Glu 


Arg 


Phe 


Arg 


He 


He 




130 










135 




Pro 


Ser 


Phe 


Arg 


Val 


Ser 


Gly Phe 


145 










150 






Thr 


Gin 


Ala 


Ala 


Asn 


Leu 


His 


Leu 



165 



Cys 


He 


Pro 


Tyr 


Asn 


Cys 


Leu 


Ser 


10 










15 




Gly Glu 


Arg 


He 


Leu 


Pro 


Asp 


He 


25 










30 






Leu 


Leu 


Gin 


Phe 


Leu 
45 


Leu 


Asn 


Asn 


Ser 


Gly 


Leu 


Leu 
60 


Asp 


Lys 


He 


Trp 


Glu 


Leu 


Phe 
75 


Leu 


Ala 


Gin 


He 


Glu 
80 


Arg 


Thr 


Val 


Arg 


Ala 


Lys 


Ala 


He 


90 










95 




Ser 


Tyr 


Gin 


Leu 


Tyr 


Gly Glu 


Ala 


105 










110 






Asp 


Asn 


Thr 


Ala 


Ala 


Arg 


Ser Arg 








125 








Asp 


Ala 


Gin 


He 
140 


Glu 


Ala 


Asn 


He 


Glu 


Val 


Pro 
155 


Leu 


Leu 


Leu 


Val 


Tyr 
160 


Ala 


Leu 


Leu Arg 


Asp 


Ser 


Val 


Val 




170 










175 
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Phe Gly Glu Arg Trp Gly Leu Thr Thr Thr Asn Val Asn Asp He Tyr 

180 18S 190 

Asn Arg Gin Val Asn Arg lie Gly Glu Tyr Ser Lys His Cys Val Asp 
195 200 205 

5 Thr Tyr Lys Thr Glu Leu Glu Arg Leu Gly Phe Arg Ser He Ala Gin 
210 215 220 

Trn Arg He Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val Leu 
225 230 235 240 

Asp He Val Ala Val Phe Pro Asn Tyr Asp Ser Arg Leu Tyr Pro He 
,0 245 250 255 

Arq Thr He Ser Gin Leu Thr Arg Glu He Tyr Thr Ser Pro Val Ser 

260 265 270 

Glu Phe Tyr Tyr Gly Val He Asn Ser Asn Asn He He Gly Thr Leu 
275 280 285 

15 Thr Glu Gin Gin He Arg Arg Pro His Leu Met Asp Phe Phe Asn Ser 
290 295 300 

Met He Met Tyr Thr Ser Asp Asn Arg Arg Glu His Tyr Trp Ser Gly 
305 310 315 320 

Leu Glu Met Thr Ala Thr Asn Thr Glu Gly His Gin Arg Ser Phe Pro 
20 325 330 335 

Leu Ala Gly Thr He Gly Asn Ser Ala Pro Pro Val Thr Val Arg Asn 

340 345 350 

Asn Gly Glu Gly He Tyr Arg He Leu Ser Glu Pro Phe Tyr Ser Ala 
355 360 365 

25 Pro Phe Leu Gly Thr Ser Val Leu Gly Ser Arg Gly Glu Glu Phe Ala 
370 375 380 

Phe Ala Ser Asn Thr Thr Thr Ser Leu Pro Ser Thr He Tyr Arg Asn 
385 390 395 400 

Arg Gly Thr Val Asp Ser Leu Val Ser He Pro Pro Gin Asp Tyr Ser 
30 405 410 415 

Val Pro Pro His Arg Gly Tyr Ser His Leu Leu Ser His Val Thr Met 

420 425 430 

Arg Asn Ser Ser Pro He Phe His Trp Thr His Arg Ser Ala Thr Pro 
435 440 445 

35 Arg Asn Thr He Asp Pro Asp Ser He Thr Gin He Pro Ala Val Lys 
450 455 460 

Glv Ala Tyr He Phe Asn Ser Pro Val He Thr Gly Pro Gly His Thr 
465 470 475 480 

Glv Gly Asp He He Arg Phe Asn Pro Asn Thr Gin Asn Asn He Arg 
40 485 490 495 

He Pro Phe Gin Ser Asn Ala Val Gin Arg Tyr Arg He Arg Met Arg 

500 505 510 

Tyr Ala Ala Glu Ala Asp Cys He Leu Glu Ser Gly Val Asn He Val 
515 520 525 

45 Thr Gly Ala Gly Val Thr Phe Arg Pro He Pro He Lys Ala Thr Met 
530 535 540 

Thr Pro Gly Ser Pro Leu Thr Tyr Tyr Ser Phe Gin Tyr Ala Asp Leu 
545 550 555 560 

Asn He Asn Leu Thr Ala Pro He Arg Pro Asn Asn Phe Val Ser He 
50 565 570 575 

Arg Arg Ser Asn Gin Pro Gly Asn Leu Tyr He Asp Arg He Glu Phe 

580 585 590 

He Pro He Asp Pro He Arg Glu Ala Glu His Asp Leu Glu Arg Ala 
595 600 605 

55 Gin Lys Ala Val Asn Ala Leu Phe Thr Ser Ser Asn Gin Leu Gly Leu 
610 615 620 

Lys Thr Asp Val Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val 
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625 630 635 640 

Ala Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Ser 

645 650 655 

Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu 
5 660 S65 670 

Gin Asp Gin Asn Phe Thr Gly He Asn Arg Gin Val Asp Arg Gly Trp 

675 680 685 

Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asn Asp Val Phe Lys 
690 695 700 

10 Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr Pro Thr 
705 710 715 720 

Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu Lys Pro Tyr Thr Arg 

725 730 735 

Tyr Glu Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu Val Tyr 
15 740 745 750 

Leu He Arg Tyr Asn Ala Lys His Glu Thr Leu Asn Val Pro Gly Thr 

755 760 765 

Gly Ser Leu Trp Pro Leu Ala Ala Glu Ser Ser He Gly Arg Cys Gly 
770 775 780 

20 Glu Pro Asn Arg Cys Ala Pro His He Glu Trp Asn Pro Glu Leu Asp 
785 790 795 800 

Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe 

805 810 815 

Ser Leu Asp He Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu Gly 
25 8 2 0 8 2 5 8 3 0 

val Trp Val He Phe Lys He Lys Thr Gin Asp Gly Tyr Ala Arg Leu 

835 840 845 

Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu 
850 855 860 

30 Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Asp Lys 
865 870 875 880 

Leu Glu Trp Glu Thr Asn He Val Tyr Lys Glu Ala Lys Glu Ser Val 

885 890 895 

Asp Ala Leu Phe Val Asp Ser Gin Tyr Asn Arg Leu Gin Thr Asp Thr 
35 900 905 910 

Asn He Ala Met He His Val Ala Asp Lys Arg Val His Arg He Rrg 

915 920 925 

Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val Asn Ala Ala 
930 935 940 

40 He Phe Glu Glu Leu Glu Gly Leu He Phe Thr Ala Phe Ser Leu Tyr 
945 950 955 960 

Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn His Gly Leu Ser 

965 970 975 

Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gin Asn Asn His 
45 980 985 990 

Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser Gin Glu 

995 1000 1005 

Val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr 
1010 1015 1020 

SO Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu Asp 
1025 1030 1035 1040 

His Thr Asp Glu Leu Lys Phe Arg Asn Cys Glu Glu Glu Glu Val Tyr 

1045 1050 1055 

Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Pro Ala Asn Gin Glu Glu 
55 1060 1065 1070 

Tyr Arg Ala Ala Glu Thr Ser Arg Asn Arg Gly Tyr Gly Glu Ser Tyr 
1075 1080 1085 
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Glu Ser Asn Ser Ser lie Pro Ala Glu Tyr Ala Pro lie Tyr Glu Lys 

1090 1095 1100 

Ala Tyr Thr Asp Gly Arg Lys Glu Asn Ser Cys Glu Ser Asn Arg Gly 
1105 1110 1115 1120 

5 Tyr Gly Asn Tyr Thr Pro Leu Pro Ala Gly Tyr Val Thr Lys Glu Leu 
1125 1130 1135 

Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp He Glu He Gly Glu Thr 

1140 1145 1150 

Glu Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
10 1155 1160 1165 



<210> 23 

15 <211> 348 

<212> DNA I 

<213> Bacillus thuringiensis 

<400> 23 

20 aataatagag gtcatcttcc aattccaatc caattttctt cgcgttctac cagatatcga 60 

gttcgtgtac gttatgcttc tgcaaccccc attcaagtca atgttcattg ggaaaatagc 120 

tcgttttttt caggtacagt accagctacg gctcagtcat tagataatct acaatcaaac 180 

aattttggtt actttgagac cgctaatact atttcatctt cattagatgg tatagtaggt 240 

attagaaatt ttagtgcaaa tgcagatttg ataatagaca gatttgaatt tatcccagtg 300 

25 gatgcaacct ccgaggcaga acatgattta gaaagagcgc aaaaggcg 348 



<210> 24 
<211> 116 
30 <212> PRT 

<213> Bacillus thuringiensis 

<400> 24 

Asn Asn Arg Gly His Leu Pro He Pro He Gin Phe Ser Ser Arg Ser 
35 1 5 10 15 

Thr Arg Tyr Arg Val Arg Val Arg Tyr Ala Ser Ala Thr Pro He Gin 
20 25 30 

Val Asn Val His Trp Glu Asn Ser Ser Phe Phe Ser Gly Thr Val Pro 
35 40 45 

Ala Thr Ala Gin Ser Leu Asp Asn Leu Gin Ser Asn Asn Phe Gly Tyr 
50 55 60 

Phe Glu Thr Ala Asn Thr He Ser Ser Ser Leu Asp Gly He Val Gly 
65 70 75 80 

He Arg Asn Phe Ser Ala Asn Ala Asp Leu He He Asp Arg Phe Glu 
85 90 95 

Phe He Pro Val Asp Ala Thr Ser Glu Ala Glu His Asp Leu Glu Arg 
100 105 110 

55 Ala Gin Lys Ala 
115 



40 



45 



SO 
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<210> 25 
<211> 186 
<212> DNA 
S <213> Bacillus thuringiensis 

<400> 25 

ccactaacat ctcgttcgtt cgctcataca acactcttca ctccaataac cttttcacga 60 
gctcaagaag aatttgatct atacatccaa tcgggtgttt atatagatcg aattgaattt 120 
10 attccagtta ctgcaacatt tgaggcagaa tatgatttag aaagagcgca aagggcggtg 180 
aatgcc 



<2io> 26 
15 <211> 62 
<212> PRT 

<213> Bacillus thuringiensis 
<400> 2& 

20 Pro Leu Thr Ser Arg Ser Phe Ala His Thr Thr Leu Phe Thr Pro lie 
1 5 10 15 

Thr Phe Ser Arg Ala Gin Glu Glu Phe Asp Leu Tyr lie Gin Ser Gly 
20 25 30 



25 



Val Tyr lie Asp Arg lie Glu Phe He Pro Val Thr Ala Thr Phe Glu 
35 40 45 



Ala Glu Tyr Asp Leu Glu Arg Ala Gin Arg Ala Val Asn Ala 
30 50 55 60 



<210> 27 
<211> 3471 
35 <212> DNA 

<213> Bacillus thuringiensis f^t^ 
<400> 27 

atgaatcgaa ataatcaaaa tgaatatgaa attattgatg ccccccattg tgggtgtcca 60 

40 tcagatgacg atgtgaggta tcctttggca agtgacccaa atgcagcgtt acaaaatatg 120 

aactataaag attacttaca aatgacagat gaggactaca ctgattctta tataaatcct 180 

agtttatcta ttagtggtag agatgcagtt cagactgcgc ttactgttgt tgggagaata 24 0 

ctcggggctt taggtgttcc gttttctgga caaatagtga gtttttatca attcctttta 300 

aatacactgt ggccagttaa tgatacagct atatgggaag ctttcatgcg acaggtggag 360 

45 gaacttgtca atcaacaaat aacagaattt gcaagaaatc aggcacttgc aagattgcaa 420 

ggattaggag actcttttaa tgtatatcaa cgttcccttc aaaattggtt ggctgatcga 480 

aatgatacac gaaatttaag tgttgttcgt gctcaattta tagctttaga ccttgatttt 540 

gttaatgcta ttccattgtt tgcagtaaat ggacagcagg ttccattact gtcagtatat 600 

gcacaagctg tgaatttaca tttgttatta ttaaaagatg catctctttt tggagaagga 660 

50 tggggattca cacaggggga aatttccaca tattatgacc gtcaattgga actaaccgct 720 

aagtacacta attactgtga aacttggtat aatacaggtt tagatcgttt aagaggaaca 780 

aatactgaaa gttggttaag atatcatcaa ttccgtagag aaatgacttt agtggtatta 840 

gatgttgtgg cgctatttcc atattatgat gtacgacttt atccaacggg atcaaaccca 900 

cagcttacac gtgaggtata tacagatccg attgtattta atccaccagc taatgttgga 960 

55 ctttgccgac gttggggtac taatccctat aatacttttt ctgagctcga aaatgccttc 1020 

attcgcccac cacatctttt tgataggctg aatagcttaa caatcagcag taatcgattt 1080 

ccagtttcat ctaattttat ggattattgg tcaggacata cgttacgccg tagttatctg 1140 
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aacgattcag cagtacaaga agatagttat ggcctaatta caaccacaag agcaacaatt 1200 
aatcctggag ttgatggaac aaaccgcata gagtcaacgg cagtagattt tcgttctgca 1260 
ttgataggta tatatggcgt gaatagagct tcttttgtcc caggaggctt gtttaatggt 1320 
acgacttctc ctgctaatgg aggatgtaga gatctctatg atacaaatga tgaattacca 1380 
5 ccagatgaaa gtaccggaag ttctacccat agactatctc atgttacctt ttttagtttt 1440 
caaactaatc aggctggatc tatagctaat gcaggaagtg tacctactta tgtttggacc 1500 
cgtcgtgatg tggaccttaa taatacgatt accccaaata gaattacaca attaccattg 1560 
gtaaaggcat ctgcacctgt ttcgggtact acggtcttaa aaggtccagg atttacagga 1620 
gggggtatac tccgaagaac aactaatggc acatttggaa cgttaagagt aacagttaat 1680 

10 tcaccattaa cacaaagata tcgcgtaaga gttcgttttg cttcatcagg aaatttcagc 1740 
ataaggatac tgcgtggaaa tacctctata gcttatcaaa gatttgggag tacaatgaac 1800 
agaggacagg aactaactta cgaatcattt gtcacaagtg agttcactac taatcagagc 1860 
gatctgcctt ttacatttac acaagctcaa gaaaatttaa caatccttgc agaaggtgtt 1920 
agcaccggta gtgaatattt tatagataga attgaaatca tccctgtgaa cccggcacga 1980 

15 gaagcagaag aggatttaga agcagcgaag aaagcggtgg cgaacttgtt tacacgtaca 2040 
\ agggacggat tacaggtaaa tgtgacagat tatcaagtgg accaagcggc aaatttagtg 2100 

tcatgcttat ccgatgaaca atatgggcat gacaaaaaga tgttattgga agcggtaaga 2160 
gcggcaaaac gcctcagccg cgaacgcaac ttacttcaag atccagattt taatacaatc 2220 
aatagtacag aagagaatgg ctggaaggca agtaacggtg ttactattag cgagggcggt 2280 

20 ccattcttta aaggtcgtgc acttcagtta gcaagcgcaa gagaaaatta tccaacatac 2340 
atttatcaaa aagtagatgc atcggtgtta aagccttata cacgctatag actagatgga 2400 
tttgtgaaga gtagtcaaga tttagaaatt gatctcatcc accatcataa agtccatctt 2460 
gCaaaaaatg taccagataa tttagtatct gatacttact cagatggttc ttgcagcgga 2520 
atcaaccgtt gtgatgaaca gcatcaggta gatatgcagc tagatgcgga gcatcatcca 2580 

25 atggattgct gtgaagcggc tcaaacacat gagttttctt cctatattaa tacaggggat 2640 
ctaaatgcaa gtgtagatca gggcatttgg gttgtattaa aagttcgaac aacagatggg 2700 
tatgcgacgt taggaaatct tgaattggta gaggttgggc cattatcggg tgaatctcta 2760 
gaacgggaac aaagagataa tgcgaaatgg aatgcagagc taggaagaaa acgtgcagaa 2820 
atagatcgtg tgtatttagc tgcgaaacaa gcaattaatc atctgtttgt agactatcaa 2880 

30 gatcaacaat taaatccaga aattgggcta gcagaaatta atgaagcttc aaatcttgta 2940 
gagtcaattt cgggtgtata tagtgataca ctattacaga ttcctgggat taactacgaa 3000 
atttacacag agttatccga tcgcttacaa caagcatcgt atctgtatac gtctagaaat 3060 
gcggtgcaaa atggagactt taacagtggt ctagatagtt ggaatacaac tatggatgca 3120 
tcggttcagc aagatggcaa tatgcatttc ttagttcttt cgcattggga tgcacaagtt 3180 

35 tcccaacaat tgagagtaaa tccgaattgt aagtatgtct tacgtgtgac agcaagaaaa 3240 
gtaggaggcg gagatggata cgtcacaatc cgagatggcg ctcatcacca agaaactctt 3300 
acatttaatg catgtgacta cgatgtaaat ggtacgtatg tcaatgacaa ttcgtatata 3360 
acagaagaag tggtattcta cccagagaca aaacatatgt gggtagaggt gagtgaatcc 3420 
gaaggttcat tctatataga cagtattgag tttattgaaa cacaagagta g 3471 

40 

<210> 28 
<211> 11S6 
<212> PRT 
45 <213> Bacillus thuringiensis 

<400> 28 

Met Asn Arg Asn Asn Gin Asn Glu Tyx Glu He He Asp Ala Pro His 
15 10 IS 

50 

Cys Gly Cys Pro Ser Asp Asp Asp Val Arg Tyr Pro Leu Ala Ser Asp 
20 25 30 

Pro Asn Ala Ala Leu Gin Asn Met Asn Tyr Lys Asp Tyr Leu Gin Met 
55 35 40 45 

Thr Asp Glu Asp Tyr Thr Asp Ser Tyr He Asn Pro Ser Leu Ser He 
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55 60 



. ser Gly Arg Asp Ala Val Gin Thr Ala Leu Thr Val Val Gly Arg lie 
65 70 75 80 

^ Leu Gly Ala Leu Gly Val Pro Phe Ser Gly Gin He Val Ser Phe Tyr 
85 90 95 

Gin Phe Leu Leu Asn Thr Leu Trp Pro Val Asn Asp Thr Ala He Trp 
,0 100 105 110 

Glu Ala Phe Met Arg Gin Val Glu Glu Leu Val Asn Gin Gin He Thr 
115 120 125 

IS Glu Phe Ala Arg Asn Gin Ala Leu Ala Arg Leu Gin Gly Leu Gly Asp 
130 135 140 

Ser Phe Asn Val Tyr Gin Arg Ser Leu Gin Asn Trp Leu Ala Asp Arg 
145 150 155 160 

20 

Asn ASP Thr Arg Asn Leu. Ser Val Val Arg Ala Gin Phe He Ala Leu 
165 170 175 

ASP Leu ASP Phe Val Asn Ala He Pro Leu Phe Ala Val Asn Gly Gin 
25 180 185 190 

Gin Val Pro Leu Leu Ser Val Tyr Ala Gin Ala Val Asn Leu His Leu 
195 200 205 

30 Leu Leu Leu Lys Asp Ala Ser Leu Phe Gly Glu Gly Trp Gly Phe Thr 
210 215 220 

Gin Gly Glu He Ser Thr Tyr Tyr Asp Arg Gin Leu Glu Leu Thr Ala 
225 230 235 240 



35 



Lvs Tyr Thr Asn Tyr Cys Glu Thr Trp Tyr Asn Thr Gly Leu Asp Arg 
245 250 255 



Leu Arq Gly Thr Asn Thr Glu Ser Trp Leu Arg Tyr His Gin Phe Arg 
40 260 265 270 

Arg Glu Met Thr Leu Val Val Leu Asp Val Val Ala Leu Phe Pro Tyr 
275 280 285 

45 Tyr Asp val Arg Leu Tyr Pro Thr Gly Ser Asn Pro Gin Leu Thr Arg 
290 295 300 

Glu val Tyr Thr Asp Pro He Val Phe Asn Pro Pro Ala Asn Val Gly 
305 , 310 315 1 . 320 

Leu Cys Arg Arg Trp Gly Thr Asn Pro Tyr Asn Thr Phe Ser Glu Leu 

325 330 335 



ss 



Glu Asn Ala Phe He Arg Pro Pro His Leu Phe Asp Arg Leu Asn Ser 
340 345 350 



Leu 



Thr He Ser Ser Asn Arg Phe Pro Val Ser Ser Asn Phe Met Asp 
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355 360 365 

Tyr Trp Ser Gly His Thr Leu Arg Arg Ser Tyr Leu Asn Asp Ser Ala 
370 375 380 

S 

Val Gin Glu Asp Ser Tyr Gly Leu lie Thr Thr Thr Arg Ala Thr He 
385 390 395 400 

Asn Pro Gly Val Asp Gly Thr Asn Arg He Glu Ser Thr Ala Val Asp 
10 405 410 415 

Phe Arg Ser Ala Leu He Gly He Tyr Gly Val Asn Arg Ala Ser Phe 
420 425 430 

15 Val Pro Gly Gly Leu Phe Asn Gly Thr Thr Ser Pro Ala Asn Gly Gly 
435 440 445 

Cys Arg Asp Leu Tyr Asp Thr Asn Asp Glu Leu Pro Pro Asp Glu Ser 
450 455 460 

20 

Thr Gly Ser Ser Thr His Arg Leu Ser His Val Thr Phe Phe Ser Phe 
465 470 475 480 

Gin Thr Asn Gin Ala Gly Ser He Ala Asn Ala . Gly Ser Val Pro Thr 
25 485 490 495 

Tyr Val Trp Thr Arg Arg Asp Val Asp Leu Asn Asn Thr He Thr Pro 
500 505 510 

30 Asn Arg He Thr Gin Leu Pro Leu Val Lys Ala Ser Ala Pro Val Ser 
5X5 520 525 

Gly Thr Thr Val Leu Lys Gly Pro Gly Phe Thr Gly Gly Gly He Leu 
530 535 540 

35 

Arg Arg Thr Thr Asn Gly Thr Phe Gly Thr Leu Arg Val Thr Val Asn 
545 550 555 560 

Ser Pro Leu Thr Gin Arg Tyr Arg Val Arg Val Arg Phe Ala Ser Ser 
40 565 570 575 

Gly Asn Phe Ser He Arg He Leu Arg Gly Asn Thr Ser He Ala Tyr 
580 585 590 

45 Gin Arg Phe Gly Ser Thr Met Asn Arg Gly Gin Glu Leu Thr Tyr Glu 
595 600 605 

Ser Phe Val Thr Ser Glu Phe Thr Thr Asn Gin Ser Asp Leu Pro Phe 

610 615 620 



50 



Thr Phe Thr Gin Ala Gin Glu Asn Leu Thr He Leu Ala Glu Gly Val 
625 630 635 640 



Ser Thr Gly Ser Glu Tyr Phe He Asp Arg He Glu He He Pro Val 
55 645 650 655 

Asn Pro Ala Arg Glu Ala Glu Glu Asp Leu Glu Ala Ala Lys Lys Ala 
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660 665 670 

Val Ala Asn Leu Phe Thr Arg Thr Arg Asp Gly Leu Gin Val Asn Val 
675 680 685 

S 

Thr Asp Tyr Gin Val Asp Gin Ala Ala Asn Leu Val Ser Cys Leu Ser 
690 695 700 

Asp Glu Gin Tyr Gly His Asp Lys Lys Met Leu Leu Glu Ala Val Arg 
10 705 710 715 720 

Ala Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gin Asp Pro Asp 
725 730 735 

15 Phe Asn Thr lie Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser Asn 
740 745 750 

Gly Val Thr lie Ser Glu Gly Gly Pro Phe Phe Lys Gly Arg Ala Leu 
755 760 765 

20 

Gin Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr He Tyr Gin Lys 
770 775 780 

Val Asp Ala Ser Val Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp Gly 
25 785 790 795 800 

Phe Val Lys Ser Ser Gin Asp Leu Glu He Asp Leu He His His His 
805 810 815 

30 Lys Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser Asp Thr 
820 825 830 

Tyr Ser Asp Gly Ser Cys Ser Gly He Asn Arg Cys Asp Glu Gin His 
835 840 845 

35 

Gin Val Asp Met Gin Leu Asp Ala Glu His His Pro Met Asp Cys Cys 
850 855 860 

Glu Ala Ala Gin Thr His Glu Phe Ser Ser Tyr He Asn Thr Gly Asp 
40 865 870 875 880 

Leu Asn Ala Ser Val Asp Gin Gly He Trp Val Val Leu Lys Val Arg 
885 890 895 

45 Thr Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu Val 

900 905 910 

Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gin Arg Asp Asn Ala 
915 920 925 

50 

Lys Trp Asn Ala Glu Leu Gly Arg Lys Arg Ala Glu He Asp Arg Val 
930 935 940 

Tyr Leu Ala Ala Lys Gin Ala He Asn His Leu Phe Val Asp Tyr Gin 
55 945 950 955 960 

Asp Gin Gin Leu Asn Pro Glu He Gly Leu Ala Glu He Asn Glu Ala 
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965 970 975 

Ser Asn Leu Val Glu Ser He Ser 61y Val Tyr Ser Asp Thr Leu Leu 
980 985 990 

S 

Qln He Pro Gly He Asn Tyr Glu He Tyr Thr Glu Xjeu Ser Asp Arg 
995 1000 1005 

Leu Gin Gin Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val Gin Asn 
10 1010 1015 1020 

Gly Asp Phe Asn Ser Gly Leu Asp Ser Trp Asn Thr Thr Met TVsp Ala 
1025 1030 1035 1040 

15 Ser Val Gin Gin Asp Gly Asn MeC His Phe Leu Val Leu Ser His Trp 
1045 1050 ,1055 

Asp Ala Gin Val Ser Gin Gin Leu Arg Val Asn Pro Asn Cys Lys Tyr 
1060 1065 1070 



20 



35 



Val Leu Arg Val Thr Ala Arg Lys Val Gly Gly Gly Asp Gly Tyr Val 
1075 1080 1085 



Thr He Arg Asp Gly Ala His His Gin Glu Thr l.eu Thr Phe Asn Ala 
2S 1090 1095 1100 

Cys Asp Tyr Asp Val Asn Gly Thr Tyr Val Asn Asp Aisn Ser Tyr He 
1105 1110 1115 1120 

30 Thr Glu Glu Val Val Phe Tyr Pro Glu Thr Lys His Met Trp Val Glu 
1125 1130 1135 

Val Ser Glu Ser Glu Gly Ser Phe Tyr He Asp Ser He Glu Phe He 
1140 1145 1150 



Glu Thr Gin Glu 
1155 



40 <210> 29 

<211> 2407 
<212> DNA 

<213> Bacillus thuringiensis 

45 <400> 29 

atgaatcaaa ataaacacgg aattattggc 
gttgcgaaat atcctttagc caacaatcca 
caaaatagta gtattctcaa ctggattaac 
tctattggga caaccatagt ctctcttatc 

50 atagtatatg accttatagg taaagtacta 
ttgtctatat gtgacttatt atctattatt 
gatgggattg cagattttaa tggttctgta 
gatagctgga ataagaatcc taattctgct 
atcgccgact cagaatttga tagaatttta 

SS ttagctagac aaaatgccca aatattatta 
catttattac tactaaggga tgctactaga 
acacctttta taaattatca atcaaaacta 



gcttccaatt gtggttgtgc atctgatgat 60 
tattcatctg ctttaaattt aaattcttgt 120 
ataataggcg atgcagcaaa agaagcagta 18 0 
acagcacctt ctcttactgg attaatttca 24 0 
ggaggtagta gtggacaatc catatcagat 300 
gatttacggg taagtcagag tgttttaaat 360 
ctcttataca ggaactattt agaggctctg 420 
tctgctgaag aactccgtac tcgttttaga 480 
acccgagggt ctttaacgaa tggtggctcg 540 
ttaccttctt ttgcgagcgc tgcatttttc 600 
tatggcacta attgggggct atacaatgct 660 
gtagagctta ttgaactata tactgattat 720 
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lo 



IS 



20 



25 



30 



tgcgtacatt gggataatcg aggttcaacc gaactaagac aacgagggcc tagtgctaca 780 
gcttggttag aatttcatag atatcggaga gagatgacat tgatgggatt agaaatagta 840 
gcatcatttt caagtcttga tattactaat tacccaatag aaacagattt tcagttgagt 900 
agggtcattt atacagatcc aattggtttt gtacatcgta gtagtcttag gggagaaagt 960 
tggtttagct ttgttaatag agctaatttc tcagatttag aaaatgcaat acctaatcct 1020 
agaccgtctt ggtttttaaa taatatgatt atatctactg gttcacttac attgccggtt 1080 
agcccaagta ctgatagagc gagggtatgg tatggaagtc gagatcgaat ttcccctgct 1140 
aattcacaat ttattactga actaatctct ggacaacata cgactgctac acaaactatt 1200 
ttagggcgaa atatatttag agtagattct caagcttgta atttaaatga taccacatat 1260 
ggagtgaata gggcggtatt tcatcatgat gcgagtgaag gttctcaaag atccgtgtac 1320 
gaqqggtata ttcgaacaac tgggatagat aaccctagag ttcaaaatat taacacttat 1380 
ttacctggag aaaattcaga tatcccaact ccagaagact atactcatat attaagcaca 1440 
acaataaatt taacaggagg acttagacaa gtagcatcta atcgccgttc atctttagta 1500 
atgtatggtt ggacacataa aagtctggct cgtaacaata ccattaatcc agatagaatt 1560 
acacagatac cattgacgaa ggttgatacc cgaggcacag gtgtttctta tgtgaatgat 1620 
ccaggattta taggaggagc tctacttcaa aggactgacc atggttcgct tggagtattg 1680 
agggtccaat ttccacttca cttaagacaa caatatcgta ttagagtccg ttatgcttct 1740 
acaacaaata ttcgattgag tgtgaatggc agtttcggta ctatttctca aaatctccct 1800 
agtacaatga gattaggaga ggatttaaga tacggatctt ttgctataag agagtttaat 18G0 
acttctatta gacccactgc aagtccggac caaattcgat tgacaataga accatctttt 1920 
attagacaag aggtctatgt agatagaatt gagttcattc cagttaatcc gacgcgagag 1980 
gcgaaagagg atctagaagc agcaaaaaaa gcggtggcga gcttgtttac acgcacaagg 2040 
gacggattac aagtaaatgt gaaagattat 'caagtcgatc aagcggcaaa tttagtgtca 2100 
tgcttatcag atgaacaata tgggtatgac aaaaagatgt tattggaagc ggtacgtgcg 2160 
gcaaaacgac ttagccgaga acgcaactta cttcaggatc cagattttaa tacaatcaat 2220 
agtacagaag aaaatggatg gaaagcaagt aacggcgtta ctattagtga gggcgggcca 2280 
ttctataaag gccgtgcaat tcagctagca agtgcacgag aaaattaccc aacatacatc 2340 
tatcaaaaag tagatgcatc ggagttaaag ccgtatacac gttatagact ggatgggttc 2400 
gtgaaga 



40 



55 



<210> 30 

<211> 802 

<212> PRT 

35 <213> Bacillus thuringiensis 

<400> 30 

Met Asn Gin Asn Lys His Gly He He Gly Ala Ser Asn Cys Gly Cys 
1 5 10 15 



Ala Ser Asp Asp Val Ala Lys Tyr Pro Leu Ala Asn Asn Pro Tyr Ser 
20 25 30 



Ser Ala Leu Asn Leu Asn Ser Cys Gin Asn Ser Ser He Leu Asn Trp 
45 35 40 45 

He Asn He He Gly Asp Ala Ala Lys Glu Ala Val Ser He Gly Thr 



50 



55 60 



50 Thr He Val Ser Leu He Thr Ala Pro Ser Leu Thr Gly Leu He Ser 
65 70 75 80 

He Val Tyr Asp Leu He Gly Lys Val Leu Gly Gly Ser Ser Gly Gin 



85 



90 95 



Ser He Ser Asp Leu Ser He Cys Asp Leu Leu Ser He He Asp Leu 
100 105 110 
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Arg Val Ser Gin Ser Val Leu Asn Asp Gly lie Ala Asp Phe Asn Gly 
115 120 125 

5 Ser Val Leu Leu Tyr Arg Asn Tyr Leu Glu Ala Leu Asp Ser Trp Asn 
130 135 140 

Lys Asn Pro Asn Ser Ala Ser Ala Glu Glu Leu Arg Thr Arg Phe Arg 
145 ISO 155 160 

10 

lie Ala Asp Ser Glu Phe Asp Arg lie Leu Thr Arg Gly Ser Leu Thr 
165 170 175 

Asn Gly Gly Ser Leu Ala Arg Gin Asn Ala Gin lie Leu Leu Leu Pro 
15 180 185 ' 190 

Ser Phe Ala Ser Ala Ala Phe Phe His Leu Leu Leu Leu Arg' Asp Ala 
195 200 205 

20 Thr Arg Tyr Gly Thr Asn Trp Gly Leu Tyr Asn Ala Thr Pro Phe lie 

210 215 220 

Asn Tyr Gin Ser Lys Leu Val Glu Leu lie Glu Leu Tyr Thr Asp Tyr 
225 230 235 240 

25 

Cys Val His Trp Asp Asn Arg Gly Ser Thr Glu Leu Arg Gin Arg Gly 
245 250 255 

Pro Ser Ala Thr Ala Trp Leu Glu Phe His Arg Tyr Arg Arg Glu Met 
30 260 265 270 

Thr Leu Met Gly Leu Glu lie Val Ala Ser Phe Ser Ser Leu Asp lie 
275 280 285 

35 Thr Asn Tyr Pro lie Glu Thr Asp Phe Gin Leu Ser Arg Val lie Tyr 
290 295 300 

Thr Asp Pro lie Gly Phe Val His Arg Ser Ser I-eu Arg Gly Glu Ser 
305 310 315 320 

40 

Trp Phe Ser Phe Val Asn Arg Ala Asn Phe Ser Asp Leu Glu Asn Ala 
325 330 335 

lie Pro Asn Pro Arg Pro Ser Trp Phe Leu Asn Asn Met lie lie Ser 
45 340 345 350 

Thr Gly Ser Leu Thr Leu Pro Val Ser Pro Ser Thr Asp Arg Ala Arg 
355 360 365 

I 

50 Val Trp Tyr Gly Ser Arg Asp Arg lie Ser Pro Ala Asn Ser Gin Phe 
370 375 380 

lie Thr Glu Leu lie Ser Gly Gin His Thr Thr Ala Thr Gin Thr lie 
385 390 395 400 

55 

Leu Gly Arg Asn lie Phe Arg Val Asp Ser Gin Ala Cys Asn Leu Asn 
405 410 415 
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Asp Thr Thr Tyr Gly Val Asn Arg Ala Val Phe Tyr His Asp Ala Ser 
420 425 430 

5 Glu Gly ser Gin Arg Ser Val Tyr Glu Gly Tyr He Arg Thr Thr Gly 
435 440 445 

He Asp Asn Pro Arg Val Gin Asn He Asn Thr Tyr Leu Pro Gly Glu 
450 455 460 

Asn ser Asp He Pro Thr Pro Glu Asp Tyr Thr His He Leu Ser Thr 



Thr He Asn Leu Thr Gly Gly Leu Arg Gin Val Ala Ser Asn Arg Arg 



15 



485 490 495 



Ser ser Leu Val Met Tyr Gly Trp Thr His Lys Ser Leu Ala Arg Asn 

500 505 510 

20 Asn Thr He Asn Pro Asp Arg He Thr Gin He Pro Leu Thr Lys Val 
515 520 525 

ASP Thr Arg Gly Thr Gly Val Ser Tyr Val Asn Asp Pro Gly Phe He 
530 535 540 

" Gly Gly Ala Leu Leu Gin Arg Thr Asp His Gly Ser Leu Gly Val Leu 
545 550 555 560 

Arg Val Gin Phe Pro Leu His Leu Arg Gin Gin Tyr Arg He Arg Val 
30 565 570 575 

Ara Tyr Ala Ser Thr Thr Asn He Arg Leu Ser Val Asn Gly Ser Phe 
580 585 590 

35 Gly Thr He Ser Gin Asn Leu Pro Ser Thr Met Arg Leu Gly Glu Asp 
595 600 «05 

Leu Arg Tyr Gly Ser Phe Ala He Arg Glu Phe Asn Thr Ser He Arg 
610 615 620 

'° pro Thr Ala Ser Pro Asp Gin He Arg Leu Thr He Glu Pro Ser Phe 
625 630 635 

He Arg Gin Glu Val Tyr Val Asp Arg He Glu Phe He Pro Val Asn 

45 645 650 655 

Pro Thr Arg Glu Ala Lys Glu Asp Leu Glu Ala Ala Lys Lys Ala Val 
660 665 670 

SO Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gin Val Asn Val Lys 
675 680 685 

ASD Tyr Gin Val Asp Gin Ala Ala Asn Leu Val Ser Cys Leu Ser Asp 
690 695 700 

" Glu Gin Tyr Gly Tyr Asp Lys Lys Met Leu Leu Glu Ala Val Arg Ala 
705 710 715 
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Ala Lys Arg Leu Ser Arg Glu Arg 
725 

5 Asn Thr lie Asn Ser Thr Glu Glu 
740 

Val Thr lie Ser Glu Gly Gly Pro 
755 760 

10 

Leu Ala Ser Ala Arg Glu Asn Tyr 
770 775 

Asp Ala Ser Glu Leu Lys Pro Tyr 
15 785 790 

Val Lys 



-33- 



Asn Leu Leu Gin Asp Pro Asp Phe 
730 735 

Asn Gly Trp Lys Ala Ser Asn Gly 
745 750 

Phe Tyr Lys Gly Arg Ala lie Gin 
765 

Pro Thr Tyr lie Tyr Gin Lys Val 
780 

Thr Arg Tyr Arg Leu Asp Gly Phe 
795 800 



20 

<210> 31 
<211> 192 
<212> DNA 

<213> Bacillus thuringiensis 

25 

<400> 31 

catttacgca acctcgtatg gatttcattt 
acttctcgtt cgtttgctct tacaactctt 
gaagaattta atctaacaat accacggggt 
30 gttatgccac at 



ccaagaacta tgggaacaga tgacccatta 60 
ttcacaccaa taaccttaac acgagcacaa 120 
gtttatatag acagaattga attcgtccca 180 

192 



<210> 32 
<211> 64 
35 <212> PRT 

<213> Bacillus thuringiensis 

<400> 32 

His Leu Arg Asn Leu Val Trp lie 
40 1 5 

Asp Asp Pro Leu Thr Ser Arg Ser 
20 

45 Pro lie Thr Leu Thr Arg Ala Gin 
35 40 

Arg Gly Val Tyr lie Asp Arg lie 

50 , 55 

SO 



Ser Phe Pro Arg Thr Met Gly Thr 
10 15 

Phe Ala Leu Thr Thr Leu Phe Thr 
25 30 

Glu Glu Phe Asn Leu Thr lie Pro 
45 

Glu Phe Val Pro Val Met Pro His 
60 



55 <210> 33 
<211> 246 
<212> DNA 
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10 



15 



20 



25 



<213> Bacillus thuringiensis 

<400> 33 



<4UU> -iJ 

gcttctacta caaatttaca attccataca tcaattgacg gaagacctat taatcagggg 60 
aatttttcag caactatgag tagtgggggt aatttacagt ccggaagctt taggactgca 120 
ggctttacta ctccgtttaa cttttcaaat ggatcaagta tatttacgtt aagtgctcat 180 
gtcttcaatt caggcaatga agtttatata gatcgaattg aatttgttcc ggcagaagta 240 



acattt 



<210> 34 
<211> 82 
<212> PRT 

<213> Bacillus thuringiensis 



<400> 34 ^, . „ 

Ala Ser Thr Thr Asn Leu Gin Phe His Thr Ser He Asp Gly Arg Pro 
1 5 10 15 

He Asn Gin Gly Asn Phe Ser Ala Thr Met Ser Ser Gly Gly Asn Leu 

25 30 



Gin Ser Gly Ser Phe Arg Thr Ala Gly Phe Thr Thr Pro Phe Asn Phe 
35 40 45 

Ser Asn Gly Ser Ser He Phe Thr Leu Ser Ala His Val Phe Asn Ser 
50 55 60 

Gly Asn Glu val Tyr He Asp Arg He Glu Phe Val Pro Ala Glu Val 
30 65 70 75 80 

Thr Phe 



35 



40 



45 



55 



<210> 35 
<211> 177 
<212> DNA 

<213> Bacillus thuringiensis 



ctctttccag attatattca gcctcgagtg ttgcagtaac tggaataaat tcaaatctgt 60 
ctattatcac tcctgcagtc ccactaaaat ttctaacacc tactatatta cctaatgaag 120 
atgtaaaagc attggcactt caaaatcact tgattgtaga ttatctaatg acgtagc 177 



<210> 36 
<211> 57 
<212> PRT 
50 <213> Bacillus thuringiensis 

Leu°ser^Arg Leu Tyr Ser Ala Ser Ser Val Ala Val Thr Gly He Asn 
1 5 10 15 



Ser Asn Leu Ser He He Thr Pro Ala Val Pro Leu Lys Phe Leu Thr 
20 25 30 
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Pro Thr lie Leu Pro Asn Glu Asp Val Lys Ala Leu Ala Leu Gin Asn 
35 40 45 

His Leu lie Val Asp Tyr Leu Met Thr 
50 55 



<210> 37 
10 <211> 4173 
<212> DNA 

<213> Bacillus thurlngiensis 

<220> 
IS <221> CDS 

<222> (1) . . (3687) 

<400> 37 

ttg act tea aat agg aaa aat gag aat gaa att ata aat get tta teg 48 

20 Met Thr Ser Asn Arg Lys Asn Glu Asn Glu lie lie Asn TQa Leu Ser 

15 10 15 

att cca get gta teg aat cat tec aea eaa atg gat eta tea cca gat 96 
He Pro Ala Val Ser Asn His Ser Thr Gin Met Asp Leu Ser Pro Asp 
25 20 25 30 

get cgt att gag gat tct ttg tgt ata gee gag ggg aat aat ate aat 144 
Ala Arg He Glu Asp Ser Leu Cys He Ala Glu Gly Asn Asn He Asn 
35 40 45 

30 

cca ctt gtt age gea tea aca gtc caa aeg ggt att aac ata get ggt 192 
Pro Leu Val Ser Ala Ser Thr val Gin Thr Gly. He Asn He Ala Gly 
50 55 60 

35 aga ata eta ggt gta tta ggc gta ccg ttt get gga caa ata get agt 240 
Arg He Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gin He Ala Ser 
65 70 75 80 

ttt tat agt ttt ctt gtt ggt ^aa tta tgg ccc ege ggc aga gat cag 288 
40 Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Gin 

85 90 95 

tgg gaa att tte eta gaa cat gtc gaa eaa ctt ata aat caa caa ata 336 
Trp Glu He Phe Leu Glu His Val Glu Gin Leu He Asn Gin Gin He 
45 100 105 110 

aca gaa aat get agg aat aeg gca ctt get cga tta caa ggt tta gga 384 
Thr Glu Asn Ala Arg Asn Thr Ala Leu Ala Arg Leu Gin Gly Leu Gly 
115 120 125 

SO 

gat tec ttt aga gee tat eaa cag tea ctt gaa gat tgg eta gaa aac 432 
Asp Ser Phe Arg Ala Tyr Gin Gin Ser Leu Glu Asp Trp Leu Glu Asn 
130 135 140 

S5 egt gat gat gca aga aeg aga agt gtt ctt tat ace caa tat ata gee 480 
Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gin Tyr He Ala 
145 150 155 160 
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45 



SO 



55 



tta gaa ctt gat ttt ctt aat gcg atg ccg ctt ttc gca att aga aac 
Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala He Arg Asn 
165 170 175 

caa gaa gtt cca tta tta atg gta tat get caa get gca aat tta cae 
Gin Glu Val Pro Leu Leu Met Val Tyr Ala Gin Ala Ala Asn Leu His 
180 185 190 



agt atg aat tgg tat aat aat aat gca cct teg ttt tec get ata gag 

Ser Met Asn Trp Tyr Asn Asn Asn Ala Pro Ser Phe Ser Ala He Glu 

40 305 310 315 320 

act gcg gtt ate cga age ecg cat eta ctt gat ttt eta gaa caa ctt 

Thr Ala Val He Arg Ser Pro His Leu Leu Asp Phe Leu Glu Gin Leu 

325 330 335 



528 



576 



10 eta tta tta ttg aga gat gcc tct ctt ttt ggt agt gaa ttt ggg ctt 624 

Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu 

195 200 205 

aca teg cag gaa att caa cgt tat tat gag cgc caa gtg gaa caa acg 672 

15 Thr Ser Gin Glu He Gin Arg Tyr Tyr Glu Arg Gin Val Glu Gin Thr 
210 215 220 I 



aga gat tat tec gac tat tgc gta gaa tgg tat aat aca ggt eta aat 
Arg Asp Tyr Ser Asp Tyr Cys Val Glu Trp Tyr Asn Thr Gly Leu Asn 
20 225 230 235 240 

age ttg aga ggg aca aat gcc gca agt tgg gtg cgt tat aat eaa ttc 

Ser Leu Arg Gly Thr Asn Ala Ala Ser Trp Val Arg Tyr Asn Gin Phe 

245 250 255 

25 

cgt aga gat eta acg tta ggg gta tta gat eta gtg gca eta ttc cca 

Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro 
260 265 270 

30 age tat gac act cgc act tat cca ata aat acg agt get cag tta aca 864 
Ser Tyr Asp Thr Arg Thr Tyr Pro He Asn Thr Ser Ala Gin Leu Thr 
275 280 285 



720 



768 



816 



agg gaa gtt tat aca gac gca att gga gca aca ggg gta aat atg gca 912 
35 Arg Glu Val Tyr Thr Asp Ala He Gly Ala Thr Gly Val Asn Met Ala 
290 295 300 



960 



1008 



aca att ttt age act tea tea cga tgg agt get act agg cat atg act 1056 

Thr He Phe Ser Thr Ser Ser Arg Trp Ser Ala Thr Arg His Met Thr 
340 345 350 

tac tgg egg ggg cac aca att caa tct egg cca ata gga ggc gga tta 1104 

Tyr Trp Arg Gly His Thr He Gin Ser Arg Pro He Gly Gly Gly Leu 

355 360 365 

aat ace tea acg eat ggg tct ace aat act tct att aat cct gta aga 1152 

Asn Thr Ser Thr His Gly Ser Thr Asn Thr Ser He Asn Pro Val Arg 

370 375 380 
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tta tea ttc ttc tct cga gac gta tat tgg act gaa tea tat gca gga 1200 
Leu Ser Phe Phe Ser Arg Asp Val Tyr Trp Thr Glu Ser Tyr Ala Gly 
385 390 395 400 

S gtg ctt eta tgg gga att tac ctt gaa cct att cat ggt gtc cct act 1248 
Val Leu Leu Trp Gly He Tyr Leu Glu Pro He His Gly Val Pro Thr 
405 410 415 

gtt aga ttt aat ttt agg aac cct cag aat act ttt gaa aga ggt act 1296 
10 Val Arg Phe Asn Phe Arg Asn Pro Gin Asn Thr Phe Glu Arg Gly Thr 
420 425 430 

get aac tat agt eaa cee tat gag tea eet ggg ett eaa tta aaa gat 1344 
Ala Asn Tyr Ser Gin Pro Tyr Glu Ser Pro Gly Leu Gin Leu Lys Asp 
IS 435 440 445 

tea gaa act gaa tta cea cea gaa aca aca gaa ega cca aat tat gaa 1392 

Ser Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu 

450 455 460 

20 

tea tat agt eat agg tta tct cae ata ggg etc att tea eaa tct agg 1440 

Ser Tyr Ser His Arg Leu Ser His He Gly Leu He Ser Gin Ser Arg 
465 470 475 480 

25 gtg cat gta cca gta tat tct tgg acg cac cgt agt gca gat cgt aca 1488 
Val His Val Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr 
485 490 495 

aat ace att agt tea gat age ata aca caa ata cca ttg gta aaa tea 1536 
30 Asn Thr He Ser Ser Asp Ser He Thr Gin He Pro Leu Val Lys Ser 
500 505 510 

ttc aac ctt aat tea ggt acc tct gta gtc agt ggc cca gga ttt aca 1584 
Phe Asn Leu Asn Ser Gly Thr Ser Val Val Ser Gly Pro Gly Phe Thr 
35 515 520 525 

gga ggg gat ata ate cga act aac gtt aat ggt agt gta eta agt atg 1632 
Gly Gly Asp He He Arg Thr Asn Val Asn Gly Ser Val Leu Ser Met 
530 535 540 

40 

ggt ctt aat ttt aat aat aca tea tta cag egg tat cgc gtg aga gtt 1680 
Gly Leu Asn Phe Asn Asn Thr Ser Leu Gin Arg Tyr Arg Val Arg Val 
545 550 555 560 

45 cgt tat get get tct caa aca atg gtc etg agg gta act gtc gga ggg 1728 
Arg Tyr Ala Ala Ser Gin Thr Met Val Leu Arg Val Thr Val Gly Gly 
565 570 575 

agt act act ttt gat caa gga ttc eet agt act atg agt gca aat gag 1776 
SO Ser Thr Thr Phe Asp Gin Gly Phe Pro Ser Thr Met Ser Ala Asn Glu 
580 585 590 

tct ttg aca tct eaa tea ttt aga ttt gca gaa ttt cct gta ggt att 1824 
Ser Leu Thr Ser Gin Ser Phe Arg Phe Ala Glu Phe Pro Val Gly He 
SS 595 600 60S 

agt gca tct ggc agt caa act get gga ata agt ata agt aat aat gca 1872 
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Ser Ala Ser Gly Ser Gin Thr Ala Gly He Ser He Ser Asn Asn Ala 
610 615 620 

ggt aga caa acg ttt cac ttt gat aaa att gaa ttc att cca att act 
Gly Arg Gin Thr Phe His Phe Asp Lys He Glu Phe lie Pro He Thr 
625 630 635 640 

gca acc ttc gaa gca gaa tac gat tta gaa agg gcg caa gag gcg gtg 
Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Glu Ala Val 



10 645 



650 655 



aat get ctg ttt act aat acg aat cca aga aga ttg aaa aca gat gtg 
Asn Ala Leu Phe Thr Asn Thr Asn Pro Arg Arg Leu Lys Thr Asp Val 



1920 



1968 



2016 



660 



665 670 



aca gat tat cat att gat caa gta tec aat tta gtg gcg tgt tta teg 2064 
Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val Ala Cys Leu Ser 



675 



680 685 



20 



gat gaa ttc tgc tta gat gaa aag aga gaa tta ett gag aaa gtg aaa 2X12 
Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Leu Glu Lys Val Lys 
690 695 700 



2208 



2256 



tat gcg aaa cga etc agt gat gaa aga aac tta etc caa gat cca aac 2160 
25 Tyr Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Glh Asp Pro Asn 
705 710 715 720 

ttc aca tec ate aat aag caa cca gac ttc ata tct act aat gag caa 
Phe Thr Ser He Asn Lys Gin Pro Asp Phe He Ser Thr Asn Glu Gin 
30 725 730 735 

teg aat ttc aea tct ate eat gaa caa tct gaa eat gga tgg tgg gga 
Ser Asn Phe Thr Ser He His Glu Gin Ser Glu His Gly Trp Trp Gly 
740 745 750 

agt gag aac att aca ate cag gaa gga aat gac gta ttt aaa gag aat 2304 ^ 
ser Glu Asn He Thr He Gin Glu Gly Asn Asp Val Phe Lys Glu Asn 
755 760 765 

40 tac gtc aca eta ccg ggg act ttt aat gag tgt tat ecg acg tat tta 
Tyr Val Thr Leu Pro Gly Thr Phe Asn Glu Cys Tyr Pro Thr Tyr Leu 
770 775 780 

tat caa aaa ata gga gag teg gaa tta aaa get tat act cgc tac caa 



45 



SO 



Tvr Gin Lys He Gly Glu Ser Glu Leu Lys Ala Tyr Thr Arg Tyr Gin 



785 790 



795 800 



tta aga ggg tat att gaa gat agt caa gat tta gag ata tat ttg att 
Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He 
805 810 815 



2352 



2400 



2448 



2496 



cgt tat aat gcg aaa cat gaa aca ttg gat gtt cca ggt acc gag tec 
Arg Tyr Asn Ala Lys His Glu Thr Leu Asp Val Pro Gly Thr Glu Ser 
820 825 830 

" gta tgg ceg ett tea gtt gaa age cca ate gga agg tgc gga gaa ccg 2544 
Val Trp Pro Leu Ser Val Glu Ser Pro He Gly Arg Cys Gly Glu Pro 
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835 840 845 

aat cga tgc gca cca cat ttt gaa tgg aat cct gat eta gat tgt tec 
Asn Arg Cys Ala Pro His Phe Glu Trp Asn Pro Asp Leu Asp Cys Ser 
850 855 860 



aga aat gtc gtt aaa aat ggt gat ttt aat aat gga tta gca tgc tgg 
Arg Asn Val Val Lys Asn Gly Asp Phe Asn Asn Gly Leu Ala Cys Trp 
1025 1030 1035 1040 



2592 



tgc aga gat gga gaa aaa tgt gcg cat cat tec cat cat ttc tct ttg 2640 
Cys Arg TVsp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu 
865 870 875 880 

gat att gat att gga tgc aca gac ttg cat gag aat eta gge gtg tgg 2688 
Asp lie Asp lie Gly Cys Thr Asp Leu His Glu Asn Leu Gly Val Trp 
885 890 895 

15 gtg gta ttc aag att aag acg cag gaa ggt cat gca aga eta ggg aat 2736 
Val Val Phe Lys lie Lys Thr Gin Glu Gly His Ala Arg Leu Gly Asn 
900 905 910 

ctg gaa ttt att gaa gag aaa cca tta tta gga gaa gca ctg tct cgt 2784 
20 Leu Glu Phe lie Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ser Arg 
915 920 925 

gtg aag aga gca gag aaa aaa tgg aga gac aaa cgt gaa aaa eta caa 2832 
Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin 
25 930 935 940 

ttg gaa aca aaa cga gta tat aca gag gca aaa gaa get gtg gat get 2880 
Leu Glu Thr Lys Arg Val Tyr Thr Glu Ala Lys Glu Ala Val Asp Ala 
945 950 955 960 

30 

tta ttt gta gat tct caa tat aat aga tta caa gcg gat aca aac att 2928 
Leu Phe Val Asp Ser Gin Tyr Asn Arg Leu Gin Ala Asp Thr Asn lie 
965 970 975 

35 ggc atg att cat gcg gca gat aaa ctt gtt cat cga att cga gag get 2976 
Gly Met lie His Ala Ala Asp Lys Leu Val His Arg lie Arg Glu Ala 
980 985 990 

tat ctg tea gaa tta tct gtt ate ccg ggt gta aat gcg gaa att ttt 3024 
40 Tyr Leu Ser Glu Leu Ser Val lie Pro Gly Val Asn Ala Glu lie Phe 
995 1000 1005 

gaa gaa tta gaa ggt cgc att ate act gca ate tec eta tae gat gcg 3072 
Glu Glu Leu Glu Gly Arg lie lie Thr Ala lie Ser Leu Tyr Asp Ala 
45 1010 1015 1020 



3120 



aat gta aaa ggg cat gta gat gta caa cag age cat eac cgt tct gtc 3168 
Asn Val Lys Gly His Val Asp Val Gin Gin Ser His His Arg Ser Val 
1045 1050 1055 

55 Ctt gtt ate cca gaa tgg gaa gca gaa gtg tea caa gca gtt ege gtc 3216 
Leu Val lie Pro Glu Trp Glu Ala Glu Val Ser Gin Ala Val Arg Val 
1060 1065 1070 
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tgt ccg ggg cgt ggc tat ate etc cgt gtc aca gcg tac aaa gag gga 3264 
Cys Pro Gly Arg Gly Tyr lie Leu Arg Val Thr Ala Tyr Lys Glu Gly 
1075 1080 1085 

5 

tat gga gag ggt tgt gta acg ate cat gaa ate gag aae aat aca gac 3312 
Tyr Gly Glu Gly Cys Val Thr lie His Glu lie Glu Asn Asn Thr Asp 
1090 1095 1100 

10 gaa eta aaa ttt aaa aac tgt gaa gaa gag gaa gtg tat cca acg gat 3360 
Glu Leu Lys Phe Lys Asn Cys Glu Glu Glu Glu Val Tyr Pro Thr Asp 
1105 1110 1115 1120 

aca gga aeg tgt aat gat tat act gca cac caa ggt aca gca gca tgt 3408 
15 Thr Gly Thr Cys Asn Asp Tyr Thr Ala His Gin Gly Thr Ala Ala Cys 
1125 1130 ' 1135 

aat tee cgt aat get gga tat gag gat gca tat gaa gtt gat act aca 3456 
Asn Ser Arg Asn Ala Gly Tyr Glu Asp Ala Tyr Glu Val Asp Thr Thr 
20 1140 1145 1150 



25 



gca tct gtt aat tac aaa ccg act tat gaa gaa gaa aeg tat aca gat 
Ala Ser Val Asn Tyr Lys Pro Thr Tyr Glu Glu Glu Thr Tyr Thr Asp 
1155 1160 1165 



3504 



gta cga aga gat aat cat tgt gaa tat gac aga ggg tat gtg aat tat 3552 
Val Arg Arg Asp Asn His Cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr 
1170 1175 1180 

30 cca cca eta cca get ggt tat gtg aca aag gaa tta gaa tat ttc eca 3600 
Pro Pro Leu Pro Ala Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro 
1185 1190 1195 1200 

gaa ace gat aag gta tgg att gag att gga gaa acg gaa gga aca ttc 3648 
35 Glu Thr Asp Lys Val Trp lie Glu lie Gly Glu Thr Glu Gly Thr Phe 



40 




1220 


1225 










gtatagcagt 


ttaataaatc 


ttaateaaaa 


tagtagtcta 


actteegtta 


caatttaata 


3757 


45 


agtaaattac 


agttgtaaaa 


agaaaacgga 


catcactect 


aagagagega 


tgtccgtttt 


3817 


ctatatggtg 


tgtgctaaeg 


ataagtgtac 


aeggaatttc 


attatceaaa 


ttaatattta 


3877 




tttgagaaaa 


ggateatgtt 


atatagagat 


atttccttat 


aatatttgtt 


ceaegttcat 


3937 


SO 


aatttttgaa 


tgatacatta 


caacaaaaae 


tgtcacaaat 


ttatatgttc 


tacataaaat 


3997 




atatggttaa 


gaacetaaga 


agttatgaat 


caagtaatag 


gataaaactg 


aaaaaggaag 


4057 


55 


tgtaggtaca 


atgaataaaa 


aaataagaaa 


tgaagatgag 


cattcatega 


tagaattatc 


4117 


atatagtact 


tcaaaaaate 


aaaagcataa 


ggtaceattt 


tgttgtacaa 


tttcag 


4173 



1205 1210 1215 S 

ate gtg gac age ata gaa tta etc ctt atg gaa gaa tag gacegtccga 3697 
lie Val Asp Ser lie Glu Leu Leu Leu Met Glu Glu 
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<210> 38 
<211> 1228 
<212> PRT 
S <213> Bacillus thuringiensis 

<:400> 38 

Met Thr Ser Asn Arg Lys Asn Glu Asn Glu lie He Asn Ala Leu Ser 
15 10 15 

10 He Pro Ala Val Ser Asn His Ser Thr Gin Met Asp Leu Ser Pro Asp 
20 25 30 

Ala Arg He Glu Asp Ser Leu Cys He Ala Glu Gly Asn Asn He Asn 

35 40 45 

Pro Leu Val Ser Ala Ser Thr Val Gin Thr Gly He Asn He Ala Gly 
15 50 55 60 

Arg He Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gin He Ala Ser 
65 70 75 80 

Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Gin 
85 90 95 

20 Trp Glu He Phe Leu Glu His Val Glu Gin Leu He Asn Gin Gin He 
100 105 110 

Thr Glu Asn Ala Arg Asn Thr Ala Leu Ala Arg Leu Gin Gly Leu Gly 

115 120 125 

Asp Ser Phe Arg Ala Tyr Gin Gin Ser Leu Glu Asp Trp Leu Glu Asn 
25 130 135 140 

Arg Asp Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gin Tyr He Ala 
145 150 155 160 

Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala He Arg Asn 
165 170 175 

30 Gin Glu Val Pro Leu Leu Met Val Tyr Ala Gin Ala Ala Asn Leu His 
180 185 190 

Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu 

195 200 205 

Thr Ser Gin Glu He Gin Arg Tyr Tyr Glu Arg Gin Val Glu Gin Thr 
35 210 215 220 

Arg Asp Tyr Ser Asp Tyr Cys Val Glu Trp Tyr Asn Thr Gly Leu Asn 
225 230 235 240 

Ser Leu Arg Gly Thr Asn Ala Ala Ser Trp Val Arg Tyr Asn Gin Phe 
245 250 255 

40 Arg Arg Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro 
260 265 270 

Ser Tyr Asp Thr Arg Thr Tyr Pro He Asn Thr Ser Ala Gin Leu Thr 

275 280 285 

Arg Glu Val Tyr Thr Asp Ala He Gly Ala Thr Gly Val Asn Met Ala 
45 290 295 300 

Ser Met Asn Trp Tyr Asn Asn Asn Ala Pro Ser Phe Ser Ala He Glu 
305 310 315 320 

Thr Ala Val He Arg Ser Pro His Leu Leu Asp Phe Leu Glu Gin Leu 
325 330 335 

50 Thr He Phe Ser Thr Ser Ser Arg Trp Ser Ala Thr Arg His Met Thr 
340 345 350 

Tyr Trp Arg Gly His Thr He Gin Ser Arg Pro He Gly Gly Gly Leu 

355 360 365 

Asn Thr Ser Thr His Gly Ser Thr Asn Thr Ser He Asn Pro Val Arg 
55 370 375 380 

Leu Ser Phe Phe Ser Arg Asp Val " Tyr Trp Thr Glu Ser Tyr Ala Gly 
385 390 395 400 
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Val Leu Leu Trp Gly He Tyr Leu Glu Pro He His Gly Val Pro Thr 

405 410 415 

Val Arg Phe Asn Phe Arg Asn Pro Gin Asn Thr Phe Glu Arg Gly Thr 
420 425 430 

5 Ala Asn Tyr Ser Gin Pro Tyr Glu Ser Pro Gly Leu Gin Leu Lys Asp 
435 440 445 

Ser Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu 

450 455 460 

Ser Tyr Ser His Arg Leu Ser His He Gly Leu He Ser <31n Ser Arg 
10 465 470 475 480 

Val His Val Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr 

485 490 495 

Asn Thr He Ser Ser Asp Ser He Thr Gin He Pro Leu Val Lys Ser 
500 505 510 

15 Phe Asn Leu Asn Ser Gly Thr Ser Val Val Ser Gly Pro Gly Phe Thr 
515 520 525 

Gly Gly Asp He He Arg Thr Asn Val Asn Gly Ser Val Leu Ser Met 

530 535 540 

Gly Leu Asn Phe Asn Asn Thr Ser Leu Gin Arg Tyr Arg Val Arg Val 
20 545 550 555 560 

Arg Tyr Ala Ala Ser Gin Thr Met Val Leu Arg Val Thr Val Gly Gly 

565 570 575 

Ser Thr Thr Phe Asp Gin Gly Phe Pro Ser Thr Met Ser Ala Asn Glu 
580 585 590 

25 Ser Leu Thr Ser Gin Ser Phe Arg Phe Ala Glu Phe Pro Val Gly He 
595 600 605 

Ser Ala Ser Gly Ser Gin Thr Ala Gly He Ser He Ser Asn Asn Ala 

610 615 620 

Gly Arg Gin Thr Phe His Phe Asp Lys He Glu Phe He Pro He Thr 
30 625 G30 635 640 

Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Glu Ala Val 

645 650 655 

Asn Ala Leu Phe Thr Asn Thr Asn Pro Arg Arg Leu Lys Thr Asp Val 
660 665 670 

35 Thr Asp Tyr His He Asp Gin Val Ser Asn Leu Val Ala Cys Leu Ser 
675 680 685 

Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Leu Glu Lys Val Lys 

690 695 700 

Tyr Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn 
40 705 710 715 720 

Phe Thr Ser He Asn Lys Gin Pro Asp Phe He Ser Thr Asn Glu Gin 

725 730 735 

Ser Asn Phe Thr Ser He His Glu Gin Ser Glu His Gly Trp Trp Gly 
740 745 750 

45 Ser Glu Asn He Thr He Gin Glu Gly Asn Asp Val Phe Lys Glu Asn 
755 760 765 

Tyr Val Thr Leu Pro Gly Thr Phe Asn Glu Cys Tyr Pro Thr Tyr Leu 

770 775 780 

Tyr Gin Lys He Gly Glu Ser Glu Leu Lys Ala Tyr Thr Arg Tyr Gin 
50 7 8 5 7 9 0 7 9 5 8 0 0 

Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He 

805 810 815 

Arg Tyr Asn Ala Lys His Glu Thr Leu Asp Val Pro Gly Thr Glu Ser 
820 825 830 

55 Val Trp Pro Leu Ser val Glu Ser Pro He Gly Arg Cys Gly Glu Pro 
835 840 845 

Asn Arg Cys Ala Pro His Phe Glu Trp Asn Pro Asp Leu Asp Cys Ser 
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850 855 860 

Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu 
865 870 875 880 

Asp lie Asp lie Gly Cys Thr Asp Leu His Glu Asn Leu Gly Val Trp 
S 885 890 895 

Val Val Phe Lys lie Lys Thr Gin Glu Gly His Ala Arg Leu Gly Asn 

900 905 910 

Leu Glu Phe lie Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ser Arg 
915 920 925 

10 Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin 
930 935 940 

Leu Glu Thr Lys Arg Val Tyr Thr Glu Ala Lys Glu Ala Val Asp Ala 
945 950 955 960 

Leu Phe Val Asp Ser Gin Tyr Asn Arg Leu Gin Ala Asp Thr Asn lie 
15 965 970 975 

Gly Met lie His Ala Ala Asp Lys Leu Val His Arg He Arg Glu Ala 

980 985 99Q 

Tyr Leu Ser Glu Leu Ser Val He Pro Gly Val Asn Ala Glu He Phe 
995 1000 1005 

20 Glu Glu Leu Glu Gly Arg lie He Thr Ala He Ser Leu Tyr Asp Ala 
1010 1015 1020 

Arg Asn Val Val Lys Asn Gly Asp Phe Asn Asn Gly Leu Ala Cys Trp 
1025 1030 1035 1040 

Asn Val Lys Gly His Val Asp Val Gin Gin Ser His His Arg Ser Val 
25 1045 1050 1055 

Leu Val He Pro Glu Trp Glu Ala Glu Val Ser Gin Ala Val Arg Val 

1060 1065 1070 

Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly 
1075 1080 1085 

30 Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu Asn Asn Thr Asp 
1090 1095 1100 

Glu Leu Lys Phe Lys Asn Cys Glu Glu Glu Glu Val Tyr Pro Thr Asp 
1105 1110 1115 1120 

Thr Gly Thr Cys Asn Asp Tyr Thr Ala His Gin Gly Thr Ala Ala Cys 
35 1125 1130 1135 

Asn Ser Arg Asn Ala Gly Tyr Glu Asp Ala Tyr Glu Val Asp Thr Thr 

1140 1145 1150 

Ala Ser Val Asn Tyr Lys Pro Thr Tyr Glu Glu Glu Thr Tyr Thr Asp 
1155 1160 1165 

40 Val Arg Arg Asp Asn His Cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr 
1170 1175 1180 

Pro Pro Leu Pro Ala Gly Tyr Val Thr Lys Glu Leu Glu Tyr Phe Pro 
1185 1190 1195 1200 

Glu Thr Asp Lys Val Trp He Glu He Gly Glu Thr Glu Gly Thr Phe 
45 1205 1210 1215 

He Val Asp Ser He Glu Leu Leu Leu Met Glu Glu 
1220 1225 



<210> 39 
<211> 3504 
<212> DNA 

<213> Bacillus thuringiensis 

<220> 
<221> CDS 
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I 

<222> (1)..(3504) 

ItTlsil^aga aat aat cag gat caa tgc att cct tat aat tgt tta aat 48 
Met Glu Arg Asn Asn Gin Asp Gin Cys lie Pro Tyr Asn Cys Leu Asn 



1 



5 10 15 



50 



X30 



135 140 



96 



144 



aat cct gag att gag ata tta gat gtt gaa aat ttc aat etc gaa ctt 
Asn pro Glu lie Glu He Leu Asp Val Glu Asn Phe Asn Leu Glu Leu 
,0 20 25 30 

gta teg caa gtc agt gtg gga ctt aca cgt ttt ctt eta gag tea get 
val Ser Gin Val Ser Val Gly Leu Thr Arg Phe Leu Leu Glu Ser Ala 
35 40 4S 

gtc cea gga gcg ggt ttt gca ctt ggc eta ttc gat ate att tgg gga 192 
val Pro Gly Ala Gly Phe Ala Leu Gly Leu Phe Asp He He Trp Gly 



55 60 

■aa aac tta ttc Ctt gcg caa att gag caa 
20 gee eta ggi; yuv- n^- — — -93 a9C tta twu v.«, a a = 

Ala Leu Gly Val Asp Gin Trp Ser Leu Phe Leu Ala Gin He Glu Gin 
65 70 75 80 

tta att aat gaa agg ata aca aca gtt gaa agg aat aga gca att caa 
25 Leu He Asn Glu Arg He Thr Thr Val Glu Arg Asn Arg Ala lie Gin 

85 90 95 

aca tta agt gga eta teg agt agt tat gaa gta tat att gag gca tta 
Thr llu llr Gly Leu Ser Ser Ser Tyr Glu Val Tyr He Glu Ala Leu 
30 100 105 110 

aga gaa tgg gag aat aat cea gat aat eca get tea caa gag aga gtg 
Sg llu Trp Glu Asn Asn Pro Asp Asn Pro Ala Ser Gin Glu Arg Val 
115 120 125 

cgt aca cga ttt cgt aca acg gac gae get eta ata aca get ata cct 432 
Arg Thr Arg Phe Arg Thr Thr Asp Asp Ala Leu He Thr Ala He Pro 



240 



288 



336 



384 



480 



40 aat tta gcg att cea gat ttt gag ata get act tta tea gtg tat gtt 
Asn Leu Ala He Pro Asp Phe Glu He Ala Thr Leu Ser Val Tyr Val 
145 150 155 160 

caa gca gee aat eta cat eta tct tta tta aga gat get gtt tac ttt 
45 Gin Ala Ala Asn Leu His Leu Ser Leu Leu Arg Asp Ala Val Tyr Phe 
165 170 175 

gga gaa aga tgg gga etc aca caa gta aat att gaa gat ctt tat acg 
Gly Glu Arg Trp Gly Leu Thr Gin Val Asn He Glu Asp Leu Tyr Thr 
50 180 185 190 

aga tta aca aga aat att cat att tat tea gat cat tgt gca agg tgg 
AM Leu Thr Arg Asn He His He Tyr Ser Asp His Cys Ala Arg Trp 
195 200 205 

" tat aat caa ggt tta aat aat att gga gca aca aat acg aga tat ttg 672 
Tyr Asn Gin Gly Leu Asn Asn He Gly Ala Thr Asn Thr Arg Tyr Leu 



528 



576 



624 
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210 215 220 

gaa ttc caa aga gaa tta aca etc tct gtc tta gat att gtg gcc ctt 
Glu Phe Gin Arg Glu Leu Thr Leu Ser Val Leu Asp He Val Ala Leu 
225 230 235 240 

ttc ccg aat tac gac ate cga aca tat tea att ccg aca caa agt caa 
Phe Pro Asn Tyr Asp He Arg Thr Tyr Ser He Pro Thr Gin Ser Gin 
245 250 255 

tta aca agg gag att tat acc gat ata att get gca ccc aat gea tea 
Leu Thr Arg Glu lie Tyr Thr Asp He He Ala Ala Pro Asn Ala Ser 
260 265 270 

aat tta ata gtg gga acg caa ggc eta gtg aga gca cct cac tta atg 
Asn Leu He Val Gly Thr Gin Gly Leu Val Arg Ala Pro His Leu Met 
275 280 285 

gac ttt tta gtc cgt ttg aat att tat act ggc ttg get aga aat att 
Asp Phe Leu Val Arg Leu Asn He Tyr Thr Gly Leu Ala Arg Asn He 
290 295 300 

cgt cat tgg gca gga cat gaa gta ata tct aga aga aca ggt gga gtg 
Arg His Trp Ala Gly His Glu Val He Ser Arg Arg Thr Gly Gly Val 
25 305 310 315 320 

gat tta aat act ata caa tct cct tta tat gga aca get gea act aca 

Asp Leu Asn Thr He Gin Ser Pro Leu Tyr Gly Thr Ala Ala Thr Thr 

325 330 335 

30 

gaa agt cca cgt tta ata att cct ttt aat gag gat tct tat ctt ggt 

Glu Ser Pro Arg Leu He He Pro Phe Asn Glu Asp Ser Tyr Leu Gly 

340 345 350 

35 ggt ttt att tat aga aca tta tea tec cct att tat gta cca cct tct 
Gly Phe He Tyr Arg Thr Leu Ser Ser Pro He Tyr Val Pro Pro Ser 
355 360 365 



10 



IS 



20 



50 



55 



ggt get agt cat aga ttg tgt cat get aca ttt get caa tea cct ata 
Gly Ala Ser His Arg Leu Cys His Ala Thr Phe Ala Gin Ser Pro He 
420 425 430 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



gga att teg agt caa aga aca tet tta gtg gag ggt gtg gga ttt cag 1152 
40 Gly He Ser Ser Gin Arg Thr Ser Leu Val Glu Gly Val Gly Phe Gin 
370 375 380 

aca ccg aat aac tea ata ctt caa tac aga caa cgt gga aca ttg gat 
Thr Pro Asn Asn Ser He Leu Gin Tyr Arg Gin Arg Gly Thr Leu Asp 
45 385 390 395 400 



1200 



tec ctt gag caa gta cea ctt caa gaa gag ggg aga cca gge ggt ttt 1248 
ser Leu Glu Gin Val Pro Leu Gin Glu Glu Gly Arg Pro Gly Gly Phe 
405 410 415 



1296 



ggt act aae tat tat ata agg gca ccg ttg ttt tet tgg aeg cat ctg 1344 
Gly Thr Asn Tyr Tyr He Arg Ala Pro Leu Phe Ser Trp Thr His Leu 
435 440 445 
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agt gca act ctt act aat gaa gtt cgt gta tct aga att aca caa tta 
Ser Ala Thr Leu Thr Asn Glu Val Arg Val Ser Arg lie Thr Gin Leu 
450 455 460 

5 

ccg atg gtg aag gcg cat acg ctt cat gcg gga get act gtt gtt aga 
Pro Met Val Lys Ala His Thr Leu His Ala Gly Ala Thr Val Val Arg 
465 470 475 480 



1392 



1440 



10 gga cca gga ttt aca gga gga gat ata etc cga aga act act tea ggc 1488 

Gly Pro Gly Phe Thr Gly Gly Asp lie Leu Arg Arg Thr Thr Ser Gly 

485 490 495 

tea ttt ggg gat atg aga ata aca aat ttt tea agt tea tea teg agg 1536 

15 Ser Phe Gly Asp Met Arg lie Thr Asn Phe Ser Ser Ser Ser Ser Arg 

500 505 510 

tat cgt gta aga ata cgt tat get tct act aca gat tta caa ttt ttc 1584 

Tyr Arg Val Arg lie Arg Tyr Ala Ser Thr Thr Asp Leu Gin Phe Phe 
20 515 520 525 

ttg aat gtt gga gga acc cct gtc aat gta gcc gat ttc ccg aaa acc 1632 

Leu Asn Val Gly Gly Thr Pro Val Asn Val Ala Asp Phe Pro Lys Thr 

530 535 540 

25 

ata gat aga ggg gaa aac tta gaa tat gga age ttt aga acg gca ggt 

lie Asp Arg Gly Glu Asn Leu Glu Tyr Gly Ser Phe Arg Thr Ala Gly 
545 550 555 560 



1680 



30 ttt act acc cct ttt agt ttt gta agt tea aca aat aat ttc aca tta 
Phe Thr Thr Pro Phe Ser Phe Val Ser Ser Thr Asn TVsn Phe Thr Leu 
565 570 575 



1728 



1776 



ggt gtt cag agt gtt tct tea ggt aac gag att ttt gta gat cga att 

35 Gly Val Gin Ser Val Ser Ser Gly Asn Glu He Phe Val Asp Arg He 

580 585 590 ^ 

gaa ttt gtt ccg gca gat gca acc ttt gag gca gaa tat gat tta gaa 1824 

Glu Phe Val Pro Ala Asp Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu 
40 595 600 605 

aga gcg caa gag gcg gtg aat get ctg ttt act tct acg aat caa aga 1872 

Arg Ala Gin Glu Ala Val Asn Ala Leu Phe Thr Ser Thr Asn Gin Arg 
610 615 620 

45 

gga ctg aaa aca gat gtg acg gat tat eat att gat caa gtg tec aat 

Gly Leu Lys Thr Asp Val Thr Asp Tyr His He Asp Gin Val Ser Asn 

625 630 635 640 

SO tta gtg gat tgt tta tec gat gaa ttc tgt eta gat gaa aaa aga gaa 1968 

Leu Val Asp Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu 

645 650 655 



1920 



ttg tec gaa aaa att aaa cat gca aag cga etc agt gat gag egg aat 
55 Leu Ser Glu Lys He Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn 
660 665 670 



2016 
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tta etc caa gat tea aac ttt aga ggc ate aat aga caa cea gat egt 2064 
Leu Leu Gin Asp Ser Asn Phe Arg Gly lie Asn Arg Gin Pro Asp Arg 
675 680 685 

5 ggc tgg aga gga agt acg gat att act ate caa gga gga aat gac gta 2112 
Gly Trp Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asn Asp Val 
690 695 700 

ttc aaa gaa aat tac gtc aca eta cca ggt acc ttt gat gag tgc tat 2160 
10 Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr 
705 710 715 720 

cca aca tat ttg tat caa aaa ate gat gaa tea aaa tta aaa gee ttt 2208 
Pro Thr Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu Lys Ala Phe 
15 725 730 735 

acc cgt tat caa tta aga ggg tat ate gaa gat agt caa gac tta gaa 2256 

Thr Arg Tyr Gin Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu 

740 745 750 

20 

ate tat tta att cgc tac aat gca aaa cat gaa aca gta aat gtg cca 2304 

He Tyr Leu He Arg Tyr Asn Ala Lys His Glu Thr Val Asn Val Pro 
755 760 765 

25 ggt acg ggt tec tta tgg ccg ctt tea gee caa agt cea ate gga aag 2352 
Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala Gin Ser Pro He Gly Lys 
770 775 780 

tgt gga gag ccg aat cga tgc gcg cca cac ctt gaa tgg aat cct gac 2400 
30 Cys Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp Asn Pro Asp 
785 790 795 800 

tta gat tgt teg tgt agg gat gga gaa aag tgt gcc cat cat teg eat 2448 
Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His 
35 805 810 815 

eat ttc tec tta gae att gat gta gga tgt aca gac tta aat gag gac 2496 
His Phe Ser Leu Asp He Asp Val Gly Cys Thr Asp Leu Asn Glu Asp 
620 825 830 

40 

eta ggt gta tgg gtg ate ttt aag att aag acg caa gat ggg cac gca 2544 
Leu Gly Val Trp Val He Phe Lys He Lys Thr Gin Asp Gly His Ala 
835 840 845 

45 aga eta ggg aat eta gag ttt etc gaa gag aaa cca tta gta gga gaa 2592 
Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Val Gly Glu 
850 855 860 

geg eta get cgt gtg aaa aga geg gag aaa aaa tgg aga gae aaa cgt 2640 
50 Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg 
865 870 875 880 

gaa aaa ttg gaa tgg gaa aca aat ate gtt tat aaa gag gca aaa gaa 2688 
Glu Lys Leu Glu Trp Glu Thr Asn He Val Tyr Lys Glu Ala Lys Glu 
55 885 890 895 

tct gta gat get tta ttt gta aac tct caa tat gat caa tta caa gcg 2736 
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Ser Val Asp Ala Leu Phe Val Asn Ser Gin Tyr Asp Gin Leu Gin Ala 



900 



905 910 



qat acq aat att gcc atg att cat gcg gca gat aaa cgt gtt cat age 
ASP Thr Asn He Ala Met He His Ala Ala Asp Lys Arg Val Hxs Ser 

920 925 



915 



att cga gaa get tat ctg cct gag ctg tct gtg att ccg ggt gtc aat 
ill Arg llu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val Asn 

935 940 



10 930 



2784 



2832 



2880 



gcg get att ttt gaa gaa tta gaa ggg cgt att ttc act gca ttc tec 
La Ala lie Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala Phe Ser 
945 950 955 960 

eta tat gat gcg aga aat gtc att aaa aat ggt gat ttt aa^ aat gge 2928 



Z: Ala Z; lll Vll Lys Asn oly Asp Phe Asn Asn Gly 

970 975 



965 



20 tta tec tgc tgg aac gtg aaa ggg cat gta gat gta gaa gaa caa aac 
Leu ser Cys T^ Asn Val Lys Gly His Val Asp Val Glu Glu Gin Asn 
980 985 990 



aae caa cgt teg gtc ctt gtt gtt ccg gaa tgg gaa gca gaa gtg tea 
25 Asn Gin Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser 



995 



1000 1005 



caa gaa gtt cgt gtc tgt ccg ggt cgt ggc tat ate ctt cgt gtc aca 
Gin Glu val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr 

1015 1020 



2976 



3024 



3072 



30 1010 



gcg tac aag gag gga tat gga gaa ggt tgc gta ace att eat gag ate 3120 

Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He Hxs <^^^ ^ 
1025 1030 

" gag aae aat aca gae gaa ctg aag ttt age aae tge gta gaa gag gaa 3168 

Glu Asn Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu 

Tcmn 1055 



1045 1050 

40 ate tat cea aat aac acg gta aeg tgt aat gat tat act gta aat caa 
He Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Thr Val Asn Gin 
1060 1065 1070 



aaa aaa tae qga ggt gcg tac act tet cgt aat cga gga tat aac gaa 
45 Su ?S Tr Sy llv 111 Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu 

1080 1085 



1075 



3216 



3264 



3312 



3360 



get cet tec gta cea get gat tat gcg tea gtc tat gaa gaa aaa teg 
lla Pro ser Val Pro Ala Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser 
50 1090 1095 1100 

tat aca gat gga cga aga gag aat cct tgt gaa ttt aac aga ggg tat 
^ S ASP lly A?g Arg Glu Asn Pro Cys Glu Phe Asn Arg Gly Tyr 
ll05 1110 

" agg gat tac aeg cea eta cea gtt ggt tat gtg aca aaa gaa tta gaa 3408 
SI Asp Tyr Thr Pro Leu Pro Val Gly Tyr Val Thr Lys Glu Leu Glu 
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1125 1130 1135 

tac ttc cca gaa acc gat aag gta tgg att gag att gga gaa acg gaa 3456 
Tyr Phe Pro Glu Thr Asp Lys Val Trp lie Glu He Gly Glu Thr Glu 
1140 1145 1150 

gga aca ttt ate gtg gac age gtg gaa tta etc ctt atg gag gaa tag 3504 
Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1155 1160 1165 



<210> 40 
<211> 1167 
<212> PRT 
IS <213> Baeillus thuringiensis 

<400> 40 

Met Glu Arg Asn Asn Gin Asp Gin Cys He Pro Tyr Asn Cys Leu Asn 
15 10 15 

20 Asn Pro Glu He Glu He Leu Asp Val Glu Asn Phe Asn Leu Glu Leu 
20 25 30 

Val Ser Gin Val Ser Val Gly Leu Thr Arg Phe Leu Leu Glu Ser Ala 

35 40 45 

Val Pro Gly Ala Gly Phe Ala Leu Gly Leu Phe Asp He He Trp Gly 
25 50 55 60 

Ala Leu Gly Val Asp Gin Trp Sey Leu Phe Leu Ala Gin He Glu Gin 
65 70 75 80 

Leu He Asn Glu Arg He Thr Thr Val Glu Arg Asn Arg Ala He Gin 
85 90 95 

30 Thr Leu Ser Gly Leu Ser Ser Ser Tyr Glu Val Tyr He Glu Ala Leu 
100 105 110 

Arg Glu Trp Glu Asn Asn Pro Asp Asn Pro Ala Ser Gin Glu Arg Val 

115 120 125 

Arg Thr Arg Phe Arg Thr Thr Asp Asp Ala Leu He Thr Ala He Pro 
35 130 135 140 

Asn Leu Ala He Pro Asp Phe Glu He Ala Thr Leu Ser Val Tyr Val 
145 150 155 igo 

Gin Ala Ala Asn Leu His Leu Ser Leu Leu Arg Asp Ala Val Tyr Phe 
165 170 175 

40 Gly Glu Arg Trp Gly Leu Thr Gin Val Asn He Glu Asp Leu Tyr Thr 
180 185 190 

Arg Leu Thr Arg Asn He His He Tyr Ser Asp His Cys Ala Arg Trp 

195 200 205 

Tyr Asn Gin Gly Leu Asn Asn He Gly Ala Thr Asn Thr Arg Tyr Leu 
45 210 215 220 

Glu Phe Gin Arg Glu Leu Thr Leu Ser Val Leu Asp He Val Ala Leu 
225 230 235 240 

Phe Pro Asn Tyr Asp He Arg Thr Tyr Ser He Pro Thr Gin Ser Gin 
245 250 255 

50 Leu Thr Arg Glu He Tyr Thr Asp He He Ala Ala Pro Asn Ala Ser 
260 265 270 

Asn Leu He Val Gly Thr Gin Gly Leu Val Arg Ala Pro His Leu Met 

275 280 285 

Asp Phe Leu Val Arg Leu Asn He Tyr Thr Gly Leu Ala Arg Asn He 
55 290 295 300 

Arg His Trp Ala Gly His Glu Val He Ser Arg Arg Thr Gly Gly Val 
305 310 315 320 
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ASP Leu Asn Thr lie Gin Ser Pro Leu Tyr Gly Thr Ala Ala Thr Thr 
Glu ser Pro Arg Leu He He Pro Phe Asn Glu Asp Ser Tyr Leu Gly 
Gly Phe lie ^ Arg Thr Leu Ser Ser Pro He Tyr Val Pro Pro Ser 
Gly lie ser Ser Gin Arg Thr Ser Leu Val Glu Gly Val Gly Phe Gin 



375 



Thr pro Asn Asn Ser He Leu Gin Tyr Arg Gin Arg Gly Thr Leu Asp 



10 385 





ser Leu Glu Gin Val Pro Leu Gin Glu Glu Gly Arg Pro Gly Gly Phe 



Gly Ala ser His i^g Leu Cys His Ala ^h^ Phe Ala Gin Ser Pro He 

420 425 

,5 Gly Thr Asn Tyr Tyr He Arg Ala Pro Leu Phe Ser Trp Thr Hxs Leu 
ser Ala ihr Leu Thr Asn Glu Tal Arg Val Ser Arg He Thir Gin Leu 
Pro 21 val Lys Ala His ^hr Leu His Ala Gly ^a Thr Val Val Arg 



410 



20 465 



Z pro Gly Phe Thr lly Gly Asp He Leu ^g Arg Thr Thr Ser Gly 



475 



ser Phe Gly Asp iet Arg He Thr Asn Phe Ser Ser Ser Ser Ser Arg 

SOO 505' 
Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr Asp Leu Gin Phe Phe 

Leu Asn "l Gly Gly Thr Pro vll Asn Val Ala Asp Phe Pro Lys Thr 

lie isp Arg Gly Glu Asn Leu Glu Tyr Gly Ser Phe Arg Thr Ala Gly 



490 



'° III Thr Thr pro Phe Ser Phe Val Ser Ser Thr Asn Asn Phe Thr Leu 
Gly val Gin Ser Val ser Ser Gly Asn IZ He Phe Val Asp Arg He 



585 



35 Glu Phe val Pro Ala Asp Ala Thr Phe Glu Ala Glu Tyr Asp Leu Glu 

Arg Ala G^n Glu Ala Val Asn HI Leu Phe Thr Ser Thr Asn Gin Arg 

Oly III Lys Thr Asp Val Sr Asp Tyr His He Asp Gin Val Ser Asn 

Leu val ASP Cys Leu III Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu 

645 ® 
Leu ser Glu Lys He Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn 

45 Leu Leu Gin fsp Ser Asn Phe Arg Gly He Asn Arg Gin Pro Asp Arg 

Gly Trp irg Gly Ser Thr Asp He Thr He Gin Gly Gly Asn Asp Val 

695 700 
Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp Glu Cys Tyr 

710 7X5 
Pro Thr TVr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu Lys Ala Phe 

Thr Arg Tyr Gin Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu 
740 ■'^S " = 0 

55 He Tyr Leu He Arg Tyr Asn Ala Lys His Glu Thr Val Asn Val Pro 

Gly Thr Gly Ser Leu Trp Pro LeS Ser Ala Gin Ser Pro He Gly Lys 
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770 775 780 

Cys Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp Asn Pro Asp 
785 790 795 800 

Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His 
5 805 810 815 

His Phe Ser Leu Asp He Asp Val Gly Cys Thr Asp Leu Asn Glu Asp 

820 825 830 

Leu Gly Val Trp Val He Phe Lys He Lys Thr Gin Asp Gly His Ala 
835 840 845 

10 Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Val Gly Glu 
850 855 860 

Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg 
865 870 875 880 

Glu Lys Leu Glu Trp Glu Thr Asn He Val Tyr Lys Glu Ala Lys Glu 
IS 885 690 895 

Ser Val Asp Ala Leu Phe Val Asn Ser Gin Tyr Asp Gin Leu Gin Ala 

900 905 910 

Asp Thr Asn He Ala Met He His Ala Ala Asp Lys Arg Val His Ser 
915 920 925 

20 He Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val Asn 
930 935 940 

Ala Ala He Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala Phe Ser 
945 950 955 960 

Leu Tyr Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn Asn Gly 
25 965 970 975 

Leu Ser Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gin Asn 

980 985 990 

Asn Gin Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala Glu Val Ser 
995 1000 1005 

30 Gin Glu Val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr 
1010 1015 1020 

Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He His Glu He 
1025 1030 1035 1040 

Glu Asn Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu Glu 
35 1045 1050 1055 

He Tyr Pro Asn Asn Thr Val Thr Cys Asn Asp Tyr Thr Val Asn Gin 

1060 1065 1070 

Glu Glu Tyr Gly Gly Ala Tyr Thr Ser Arg Asn Arg Gly Tyr Asn Glu 
1075 1080 1085 

40 Ala. Pro Ser Val Pro Ala Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser 
1090 1095 1100 

Tyr Thr Asp Gly Arg Arg Glu Asn Pro Cys Glu Phe Asn Arg Gly Tyr 
1105 1110 1115 1120 

Arg Asp Tyr Thr Pro Leu Pro Val Gly Tyr Val Thr Lys Glu Leu Glu 
45 1125 1130 1135 

Tyr Phe Pro Glu Thr Asp Lys Val Trp He Glu He Gly Glu Thr Glu 

1140 1145 1150 

Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1155 1160 1165 

SO 



<210> 41 
<211> 2133 
55 <212> DMA 

<213> Bacillus 



thuringiensis 
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<220> 

<221> CDS 

<222> (1)..(2133) 



<400> 41 ^ „ AO 

atg aaa tct aag aat caa aat atg cat caa age ttg tct aac aat gcg 48 
Met Lys Ser Lys Asn Gin Asn Met His Gin Ser Leu Ser Asn Asn Ala 



1 5 



10 15 



aca gtt gat aaa aac ttt aca ggt tea eta gaa aat aac aca aat acg 
Thr val Asp Lys Asn Phe Thr Gly Ser Leu Glu Asn Asn Thr Asn Thr 

25 30 



20 



gaa tta caa aac ttt aat cat gaa ggt ata gag ccg ttt gtt agt gta 
llu Leu Gin Asn Phe Asn His Glu Gly He Glu Pro Phe Val Ser Val 

40 45 I 



35 



tea aca att caa acg ggt att ggt att get ggt aaa ate ett ggt aac 
Ser Thr He Gin Thr Gly He Gly He Ala Gly Lys He Leu Gly Asn 
50 55 60 

eta ggc gtt cct ttt get ggg caa gta get age etc tat agt ttt ate 
Leu Gly val Pro Phe Ala Gly Gin Val Ala Ser Leu Tyr Ser Phe He 

75 °" 



65 70 



eta ggt gag ctt tgg ccc aaa ggg aaa age caa tgg gaa ate ttt atg 
Leu G?y Glu Leu Trp Pro Lys Gly Lys Ser Gin Trp Glu He Phe Met 

90 95 



85 



gaa cat gta gaa gag ctt att aat caa aag ata teg act tat gca aga 
Glu His val Glu Glu Leu He Asn Gin Lys He Ser Thr Tyr Ala Arg 
100 105 HO 

aac aaa gca ctt gca gat tta aaa gga tta gga gat get ttg get gtc 
Asn Lys Ala Leu Ala Asp Leu Lys Gly Leu Gly Asp Ala Leu Ala Val 
115 120 125 

tac cat gaa teg ctg gaa agt tgg att gaa aat cge aat aac aca aga 
Tvr His Glu ser Leu Glu Ser Trp He Glu Asn Arg Asn Asn Thr Arg 
130 135 140 

acc aga agt gtt gtc aag age caa tac ate ace ttg gaa ctt atg tte 
Thr Arg Ser Val Val Lys Ser Gin. Tyr He Thr Leu Glu Leu Met Phe 
145 150 155 160 

qta caa tea tta cct tct ttt gca gtg tct gga gag gaa gta cea eta 
val Gin ser Leu Pro Ser Phe Ala Val Ser Gly Glu Glu Val Pro Leu 
165 170 175 

tta cea ata tat get caa get gca aat tta cac tta ttg eta tta cga 
Leu Pro He Tyr Ala Gin Ala Ala Asn Leu His Leu Leu Leu Leu Arg 
180 185 190 

qat get tct att ttt gga aaa taa tgg ggg tta tea gac tea gaa att 
ASP Ala ser He Phe Gly Lys Xaa Trp Gly Leu Ser Asp Ser Glu He 
195 200 205 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 
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tcc aca ttt tat aat cgc caa tec gga aaa teg aaa gaa tat tct gac 672 

Ser Thr Phe Tyr Asn Arg Gin Ser Gly Lys Ser Lys Glu Tyr Ser Asp 

210 215 220 

5 cac tgc gta aaa tgg tat aat aca ggc eta aat cgc ttg atg ggg aac 720 
His Cys Val Lys Trp Tyr Asn Thr Gly Leu Asn Arg Leu Met Gly Asn 
225 230 235 240 

aat gcc gaa agt tgg gta cga tat aat caa ttc cgt aga gac atg act 768 
10 Asn Ala Glu Ser Trp Val Arg Tyr Asn Gin Phe Arg Arg Asp Met Thr 
245 250 255 

tta atg gta eta gat tta gtg gca eta ttt cca age tat gat aca caa 816 
Leu Met Val Leu Asp Leu Val Ala Leu Phe Pro Ser Tyr Asp Thr Gin 
IS 260 265 ' 270 

atg tat cca att aaa act aca gcc caa ctt aca aga gaa gta tat aca 864 

Met Tyr Pro He Lys Thr Thr Ala Gin Leu Thr Arg Glu Val Tyr Thr 

275 280 285 

20 

gac gca att ggg aca gta cat ccg cat cca agt ttt aca agt aeg act 912 

Asp Ala He Gly Thr Val His Pro His Pro Ser Phe Thr Ser Thr Thr 

290 295 300 

25 tgg tat aat aat aat gca cct teg ttc tct acc ata gag get get gtt 960 
Trp Tyr Asn Asn Asn Ala Pro Ser Phe Ser Thr lie Glu Ala Ala Val 
305 310 315 320 

gtt cga aac ccg cat eta etc gat ttt eta gaa caa gtt aca att tac 1008 
30 Val Arg Asn Pro His Leu Leu Asp Phe Leu Glu Gin Val Thr He Tyr 
325 330 335 

age tta tta agt ega tgg agt aac act cag tat atg aat atg tgg gga 1056 
Ser Leu Leu Ser Arg Trp Ser Asn Thr Gin Tyr Met Asn Met Trp Gly 
3S 340 345 350 

gga cat aaa eta gaa ttc ega aca ata gga gga aeg tta aat acc tea 1104 
Gly His Lys Leu Glu Phe Arg Thr lie Gly Gly Thr Leu Asn Thr Ser 
355 360 365 

40 

aca caa gga tct act aat act tct att aat cct gta aca tta ccg ttc 1152 
Thr Gin Gly Ser Thr Asn Thr Ser He Asn Pro Val Thr Leu Pro Phe 
370 375 380 

45 act tct cga gac gtc tat agg act gaa tea ttg gca ggg ctg aat eta 1200 
Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Leu Ala Gly Leu Asn Leu 
385 390 395 400 

ttt tta act caa cct gtt aat gga gta cct agg gtt gat ttt cat tgg 1248 
50 Phe Leu Thr Gin Pro Val Asn Gly Val Pro Arg Val Asp Phe His Trp 
405 410 415 

aaa ttc gtc aca cat ccg ate gca tct gat aat ttc tat tat cca ggg 1296 
Lys Phe Val Thr His Pro He Ala Ser Asp Asn Phe Tyr Tyr Pro Gly 
55 420 425 430 

tat get gga att ggg aeg caa tta cag gat tea gaa aat gaa tta cca 1344 
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1392 



1440 
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Tyr Ala Gly He 61y Thr Gin Leu Gin Asp Ser Glu Asn Glu Leu Pro 
435 440 

cct gaa gca aca gga cag cca aat tat gaa tct tat agt cat aga tta 
5 Pro Glu Ala Thr Gly Gin Pro Asn Tyr Glu Ser Tyr Ser His Arg Leu 
450 455 460 

tct cat ata gga etc att tea gca tea cat gtg aaa gca ttg gta tat 
ser Til lie liy Leu He Ser Ala Ser His Val Lys Ala Leu Val Tyr 
10 465 470 475 480 

tct tag acg cat cgt agt gca gat cgt aca aat aca att gag cca aat 
ser tS Th? His A?g Ser Ala Asp Arg Thr Asn Thr He Glu Pro Asn 
485 490 495 

" age att aca caa ata cca tta gta aaa gcg ttc aat etg tc^ tea ggt 1536 
sir He Thr Gin He Pro Leu Val Lys Ala Phe Asn Leu Ser Ser Gly 
500 505 510 

qcc get gta gtg aga gga cca gga ttt aca ggt ggg gat ate ctt cga 1584 
fla III ?al Vai Arg Gly Pro Gly Phe Thr Gly Gly Asp He Leu Arg 
515 520 525 

aga aag aat act ggt aca ttt ggg gat ata ega gta aat att aat cca 
A?g lys Asn Thr Giy Thr Phe Gly Asp He Arg Val Asn He Asn Pro 
530 535 540 

cca ttt gca caa aga tat cgc gtg agg att cgc tat get tct ace aca 
pro Phe Gin A?g Tyr Arg Val Arg He Arg Tyr Ala Ser Thr Thr 
30 545 550 555 560 

aat tta caa ttc cat acg tea att aac ggt aaa get att aat caa ggt 
Isp Leu G^n Phe His Th? Ser He Asn Gly Lys Ala Ha Asn Gin Gly 
565 570 575 

" aat ttt tea gca act atg aat aga gga gag gac tta gac tat aaa acc 1776 
Asn Phe ser Ala Thr Met Asn Arg Gly Glu Asp Leu Asp Tyr Lys Thr 
580 585 590 



1632 



1680 



1728 



t-M- aaa act Qta gge ttt aec ace cca ttt age ttt tea gat gta caa 
Phe Sg ?hr Tal % Phe Thr Thr Pro Phe Ser Phe Ser Asp Val Gin 

600 ^05 



595 



agt aca ttc aca ata ggt get tgg aac ttc tct tea ggt aae gaa gtt 
45 sir Thr Phe Thr He Gly Ala Trp Asn Phe Ser Ser Gly Asn Glu Val 
610 «15 620 

tat ata gat aga att gaa ttt gtt ccg gta gaa gta aca tat gag gca 
Tyr He Lp A^g He Glu Phe Val Pro Val Glu Val Thr Tyr Glu Ala 
SO 6^5 "0 "° 



aaa tat gat ttt gaa aaa gcg caa gag gag gtt act gca etg ttt aca 
Til ryl Lp Phe Glu Lys Ala Gin Glu Glu Val Thr Ala Leu Phe Thr 
645 650 655 



1824 



1872 



1920 



1968 



2016 
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660 665 670 

gac cag gta tea aat tta gta gag tct eta tea gat aaa ttc tat ctt 2064 
Asp Gin Val Ser Asn Leu Val Glu Ser Leu Ser Asp Lys Phe Tyr Leu 
675 680 685 

gat gaa aag aga gaa tta ttc gag ata gtt aaa tae gcg aag caa etc 2112 
Asp Glu Lys Arg Glu Leu Phe Glu He Val Lys Tyr Ala Lys Gin Leu 
690 695 700 



cat att gag cgt aac atg tag 
His He Glu Arg Asn Met 
705 710 



<210> 42 
<211> 710 
<212> PRT 

<213> Bacillus thuringiensis 



<400> 42 

Met Lys Ser Lys Asn Gin Asn Met His Gin Ser Leu Ser Asn Asn Ala 

15 10 15 

Thr Val Asp Lys Asn Phe Thr Gly Ser Leu Glu Asn Asn Thr Asn Thr 
25 20 25 30 

Glu Leu Gin Asn Phe Asn His Glu Gly He Glu Pro Phe Val Ser Val 

35 40 45 

Ser Thr He Gin Thr Gly He Gly He Ala Gly Lys He Leu Gly Asn 

50 55 60 

Leu Gly Val Pro Phe Ala Gly Gin Val Ala Ser Leu Tyr Ser Phe He 
65 70 75 80 

Leu Gly Glu Leu Trp Pro Lys Gly Lys Ser Gin Trp Glu He Phe Met 

85 90 95 

Glu His Val Glu Glu Leu He Asn Gin Lys He Ser Thr Tyr Ala Arg 
35 100 105 110 

Asn Lys Ala Leu Ala Asp Leu Lys Gly Leu Gly Asp Ala Leu Ala Val 

115 120 125 

Tyr His Glu Ser Leu Glu Ser Trp lie Glu Asn Arg Asn Asn Thr Arg 
130 135 140 

40 Thr Arg Ser Val Val Lys Ser Gin Tyr He Thr Leu Glu Leu Met Phe 
145 150 155 160 

Val Gin Ser Leu Pro Ser Phe Ala Val Ser Gly Glu Glu Val Pro Leu 

165 170 175 

Leu Pro He Tyr Ala Gin Ala Ala Asn Leu His Leu Leu Leu Leu Arg 
45 1 80 1 85 190 

Asp Ala Ser He Phe Gly Lys Xaa Trp Gly Leu Ser Asp Ser Glu He 

195 200 205 

Ser Thr Phe Tyr Asn Arg Gin Ser Gly Lys Ser Lys Glu Tyr Ser Asp 

210 215 220 

His Cys Val Lys Trp Tyr Asn Thr Gly Leu Asn Arg Leu Met Gly Asn 
225 230 235 240 

Asn Ala Glu Ser Trp Val Arg Tyr Asn Gin Phe Arg Arg Asp Met Thr 

245 250 255 

Leu Met Val Leu Asp Leu Val Ala Leu Phe Pro Ser Tyr Asp Thr Gin 
55 260 265 270 

Met Tyr Pro He Lys Thr Thr Ala Gin Leu Thr Arg Glu Val Tyr Thr 
275 280 285 



50 



2133 
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Asp Ala He Gly Thr Val His Pro His Pro Ser Phe Thr Ser Thr Thr 

290 295 300 

Trp Tyr Asn Asn Asn Ala Pro Ser Phe Ser Thr He Glu Ala Ala Val 
305 310 315 320 

5 val Arg Asn Pro His Leu Leu Asp Phe Leu Glu Gin Val Thr lie Tyr 
325 330 335 

ser Leu Leu Ser Arg Trp Ser Asn Thr Gin Tyr Met Asn Met Trp Gly 

340 345 350 

Gly His Lys Leu Glu Phe Arg Thr He Gly Gly Thr Leu Asn Thr Ser 
10 355 360 365 

Thr Gin Gly Ser Thr Asn Thr Ser He Asn Pro Val Thr Leu Pro Phe 

370 375 380 

Thr ser Arg Asp Val Tyr Arg Thr Glu Ser Leu Ala Gly Leu Asn Leu 
385 390 395 400 

,5 Phe Leu Thr Gin Pro Val Asn Gly Val Pro Arg Val Asp Phe His Trp 
405 410 I 415 

LYS Phe val Thr His Pro He Ala Ser Asp Asn Phe Tyr Tyr Pro Gly 

420 425 430 

Tyr Ala Gly He Gly Thr Gin Leu Gin Asp Ser Glu Asn Glu Leu Pro 

20 435 ^-^O 

Pro Glu Ala Thr Gly Gin Pro Asn Tyr Glu Ser Tyr Ser His Arg Leu 

450 455 460 

ser His He Gly Leu He Ser Ala Ser His Val Lys Ala Leu Val Tyr 
465 470 475 480 

25 ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn Thr He Glu Pro Asn 
485 490 495 

Ser He Thr Gin He Pro Leu Val Lys Ala Phe Asn Leu Ser Ser Gly 

500 505 510 

Ala Ala val Val Arg Gly Pro Gly Phe Thr Gly Gly Asp He Leu Arg 
in 515 520 525 

Arg Lys Asn Thr Gly Thr Phe Gly Asp He Arg Val Asn He Asn Pro 

Pro III Ala Gin Arg Tyr S Val Arg He Arg Tyr Ala Ser Thr Thr 
545 550 555 560 

35 ASP Leu Gin Phe His Thr Ser He Asn Gly Lys Ala He Asn Gin Gly ^ 

565 570 575 H 

Asn Phe Ser Ala Thr Met Asn Arg Gly Glu Asp Leu Asp Tyr Lys Thr 

580 585 590 

Phe Arg Thr Val Gly Phe Thr Thr Pro Phe Ser Phe Ser Asp Val Gin 
40 5 9 5 605 

ser Thr Phe Thr He Gly Ala Trp Asn Phe Ser Ser Gly Asn Glu Val 

615 

Tyr He Asp Arg He Glu Phe Val Pro Val Glu Val Thr Tyr Glu Ala 

45 Glu Tyr ASP Phe Glu Lys Ala Gin Glu Glu Val Thr Ala Leu Phe Thr 
645 650 655 

ser Thr Asn Pro Arg Gly Leu Lys Thr Asp Val Lys Asp Tyr Hxs He 

660 665 670 

ASP Gin val Ser Asn Leu Val Glu Ser Leu Ser Asp Lys Phe Tyr Leu 
50 675 680 685 

ASP Glu Lys Arg Glu Leu Phe Glu He Val Lys Tyr Ala Lys Gin Leu 

690 695 700 

His He Glu Arg Asn Met 



705 



710 



55 
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<210> 43 
<211> 218 
<212> DNA 

<213> Bacillus thuringiensis 

5 

<400> 43 

gtagccgatt tcccgaaaac catagataga 
acggcaggtt ttactacccc ttttagtttt 
gttcagagtg tttcttcagg taacgagatt 
10 gatgcaacct ttgaggcaga atatgattta 



ggggaaaact tagaatatgg aagctttaga 60 
gtaagttcaa caaataattt cacattaggt 120 
tttgtagatc gaattgaatt tgttccggca 180 
gaaagagc 218 



<210> 44 
<211> 72 
15 <212> PRT 

<213> Bacillus thuringiensis 

I 

, <400> 44 

Val Ala Asp Phe Pro Lys Thr lie Asp Arg Gly Glu Asn Leu Glu Tyr 
20 1 5 10 15 

Gly Ser Phe Arg Thr Ala Gly Phe Thr Thr Pro Phe Ser Phe Val Ser 
20 25 30 

25 Ser Thr Asn Asn Phe Thr Leu Gly Val Gin Ser Val Ser Ser Gly Asn 
35 40 45 

Glu lie Phe Val Asp Arg lie Glu Phe Val Pro Ala Asp Ala Thr Phe 
50 55 60 



30 



Glu Ala Glu Tyr Asp Leu Glu Arg 
65 70 



35 <210> 45 

<211> 1908 
<212> DNA 

<213> Bacillus thuringiensis 

40 <220> 

<221> CDS 

<222> <1} . . (1908) 

<400> 45 

45 atg aat aat gta ttg aat age gga aaa aca act att tgt aat gcg tat 48 
Met Asn Asn Val Leu Asn Ser Gly Lys Thr Thr lie Cys Asn Ala Tyr 
1 5 10 15 

aat gta gtg get cac gat cca ttt agt ttt gaa cat aaa tea tta gat 96 
50 Asn Val Val Ala His Asp Pro Phe Ser Phe Glu His Lys Ser Leu Asp 
20 25 30 

acc ate caa gaa gaa tgg atg gag tgg aaa aga aca gat cat agt tta 144 
Thr lie Gin Glu Glu Trp Met Glu Trp Lys Arg Thr Asp His Ser Leu 
ss 35 40 45 

tat gta get cet gta gtc gga act gtg tet agt ttt ctg eta aag aaa 192 
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Tvr val Ala Pro Val Val Gly Thr Val Ser Ser Phe Leu Leu Lys Lys 
50 55 60 

qtg ggg agt eta att gga aaa agg ata ttg agt gaa tta tgg ggg tta 
5 val Gly Ser Leu He Gly Lys Arg He Leu Ser Glu Leu Trp Gly Leu 
65 70 75 80 

ata ttt cct agt ggt agt aca aat eta atg caa gat att tta aga gag 
He Phe Pro Ser Gly Ser Thr Asn Leu Met Gin Asp He Leu Arg Glu 
,0 B5 90 95 

aca gaa caa ttc eta aat caa aga ctt aat aca gac acc ctt gat cgt 



240 



288 



336 



IS 



Thr Glu Gin Phe Leu Asn Gin Arg Leu Asn Thr Asp Thr Leu Asp Arg 
100 105 110 

gta aat gca gaa ttg gaa ggg etc caa gcg aat ata agg gag ttt aat 384 
val Asn Ala Glu Leu Glu Gly Leu, Gin Ala Asn He Arg Glu Phe Asn 
115 120 125 



20 



caa caa gta gat aat ttt tta aac cct act caa aac cct gtt cct tta 
Gin Gin val Asp Asn Phe Leu Asn Pro Thr Gin Asn Pro Val Pro Leu 
130 135 140 

tea ata act tct tea gtt aat aca atg cag caa tta ttt eta aat aga 
25 Ser He Thr Ser Ser Val Asn Thr Met Gin Gin Leu Phe Leu Asn Arg 



145 150 



155 ISO 



tta ccc cag ttc cag ata caa gga tac cag ttg tta tta tta cct tta 
Leu Pro Gin Phe Gin He Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 

170 175 



30 165 



ttt gca cag gca gee aat atg cat ctt tct ttt att aga gat gtt att 
Phe Ala Gin Ala Ala Asn Met His Leu Ser Phe He Arg Asp Val He 

185 190 



432 



480 



528 



576 



180 

" Ctt aat gca gat gaa tgg gge att tea gca gca aca eta cgt acg tat 624 ^ 
Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Leu Arg Thr Tyr 

205 



195 200 

40 

Arg 

210 



cga gac tac ctg aga aat tat aca aga gat tat tct aat tat tgt ata 
Arg Lp Tyr Leu Arg Asn Tyr Thr Arg Asp Tyr Ser Asn Tyr Cys He 

215 220 



aat acg tat caa act gcg ttt aga ggg tta aac acc cgt tta cac gat 
45 Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His Asp 
225 230 235 240 



atg tta gaa ttt aga aca tat atg ttt tta aat gta ttt gaa tat gta 
Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 



so 



245 



250 255 



672 



720 



768 



816 



tec att tgg tea ttg ttt aaa tat cag agt ctt atg gta tct tct ggc 
ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Met Val Ser Ser Gly 
260 265 270 

" get aat tta tat get agt ggt agt gga cca cag cag aca caa tea ttt 864 
Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Gin Gin Thr Gin Ser Phe 
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275 280 285 

act gca caa aac tgg cca ttt tta tat tct ctt ttc caa gtt aat teg 912 
Thr Ala Gin Asn Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser 
5 290 295 300 

aat tat ata tta tct ggt att agt ggt aat agg ctt tct act acc ttc 960 

Asn Tyr lie Leu Ser Gly He Ser Gly Asn Arg Leu Ser Thr Thr Phe 

305 310 315 320 

10 

cct aat att ggt ggt tta ccg ggt agt act aca att cat tea ttg aac 1008 

Pro Asn He Gly Gly Leu Pro Giy Ser Thr Thr He His Ser Leu Asn 
325 330 335 

15 agt gcc agg gtt aat tat age gga gga gtt tea tet ggt etc ata ggg 1056 
Ser Ala Arg Val Asn Tyr Ser Gly Gly Val Ser Ser Gly Leu lie Gly 
340 345 350 

gcg act aat etc aat cac aac ttt aat tgc age acg gtc etc cct cct 1104 
20 Ala Thr Asn Leu Asn His Asn Phe Asn Cys Ser Thr Val Leu Pro Pro 
355 360 365 

tta tea aca eea ttt gtt aga agt tgg etg gat tea ggt aca gat cga 1152 
Leu Ser Thr Pro Phe Val Arg Ser Trp Leu Asp Ser Gly Thr Asp Arg 
2S 370 375 380 

gag gge gtt get aec tct acg act tgg eag aca gaa tec tte caa ata 1200 
Glu Gly Val Ala Thr Ser Thr Thr Trp Gin Thr Glu Ser Phe Gin He 
385 390 395 400 

30 

act tea ggt tta agg tgt ggt get ttt eet ttt tea get egt gga aat 1248 
Thr Ser Gly Leu Arg Cys Gly Ala Phe Pro Phe Ser Ala Arg Gly Asn 
405 410 415 

35 tea aac tat tte cca gat tat ttt ate egt aat att tet ggg gtt cot 1296 
Ser Asn Tyr Phe Pro Asp Tyr Phe He Arg Asn He Ser Gly Val Pro 
420 425 430 

tta gtt att aga aac gaa gat eta aca aga ccg tta cac tat aac caa 1344 
40 Leu Val He Arg Asn Glu Asp Leu Thr Arg Pro Leu His Tyr Asn Gin 
435 440 445 

ata aga aat ata gaa agt cct teg gga aca cct ggt gga tta cga get 1392 
He Arg Asn He Glu Ser Pro Ser Gly Thr Pro Gly Gly Leu Arg Ala 
45 450 455 460 

tat atg gta tct gtg cat aac aga aaa aat aat ate tat gcc get cat 144 0 

Tyr Met Val Ser Val His Asn Arg Lys Asn Asn He Tyr Ala Ala His 

465 470 475 480 

50 

gaa aat ggt act atg att cat ttg gca ccg gaa gat tat aca gga ttt 1488 

Glu Asn Gly Thr Met He His Leu Ala Pro Glu Asp Tyr Thr Gly Phe 
485 490 495 

55 act ata tea cca ata cat gcc act caa gtg aat aat caa act cga aca 1536 
Thr He Ser Pro He His Ala Thr Gin Val Asn Asn Gin Thr Arg Thr 
500 505 510 
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60 



10 



IS 



20 



40 



SO 



S5 



Met 


Asn 


Asn 


Val 


Leu 


Asn 


Ser 


Gly Lys 


Thr 


Thr He 


Cys 


1 








5 










10 


Glu His 




Asn 


Val 


Val 


Ala 


His 


Asp 


Pro 


Phe 


Ser 


Phe 


Lys 






20 










25 








Thr 


He 


Gin 


Glu 


Glu 


Trp 


Met 


Glu 


Trp 


Lys 


Arg Thr 


Asp 




35 










40 








45 


Tyr 


val 

50 


Ala 


Pro 


Val 


Val 


Gly 
55 


Thr 


Val 


Ser 


Ser Phe 
60 


Leu 


Val 


Gly 


Ser 


Leu 


He 


Gly 


Lys 


Arg 


lie 


Leu 


Ser Glu 


Leu 


65 








70 










75 


He 


lie 


Phe 


Pro 


Ser 


Gly 


Ser 


Thr 


Asn 


Leu 


Met 


Gin Asp 








85 










90 






Thr 


Glu 


Gin 


Phe 


Leu 


Asn 


Gin 


Arg 


Leu 


Asn 


Thr Asp 


Thr 






100 










105 








Val 


Asn 


Ala 


Glu 


Leu 


Glu 


Gly 


Leu 


Gin 


Ala 


Asn He 


Arg 




115 










120 








125 


Gin 


Gin 


Val 


Asp 


Asn 


Phe 


Leu 


Asn 


Pro 


Thr 


Gin Asn 


Pro 



IS 

;er Leu 
30 



80 

Leu Arg Glu 
95 

Leu Asp Arg 
110 



1584 



1632 



1680 



1728 



ttt att tct gaa aaa ttt gga aat caa ggt gat tec tta aga ttt gaa 
Phe He Ser Glu Lys Phe Gly Asn Gin Gly Asp Ser Leu Arg Phe Glu 
515 520 525 

caa agt aac acg aca get cgt tat acg ctt aga ggg aat gga aat agt 
Gin Ser Asn Thr Thr Ala Arg Tyr Thr Leu Arg Gly Asn Gly Asn Ser 
530 535 540 

tac aat ctt tat tta aga gta tct tea ata gga aat tea act ate cga 
Tvr Asn Leu Tyr Leu Arg Val Ser Ser He Gly Asn Ser Thr He Arg 
545 550 555 560 

gtt act ata aac ggt agg gtt tat act get tea aat gtt aat act aat 
Val Thr He Asn Gly Arg Val Tyr Thr Ala Ser Asn Val Asn Thr Asn 
565 570 I 575 

aca aat aac gat ggg gtt aat gat aat gga get cgt ttt tea gat att 1776 
Thr Asn Asn Asp Gly Val Asn Asp Asn Gly Ala Arg Phe Ser Asp He 
580 585 590 

aat ate ggt aat gta gta gca agt gat aat act aat gta ccg tta gat 1824 
Asn He Gly Asn Val Val Ala Ser Asp Asn Thr Asn Val Pro Leu Asp 
595 600 605 

" ata aat gtg aca tta aac tec ggt act caa ttt gag ctt atg aat att 1872 
He Asn Val Thr Leu Asn Ser Gly Thr Gin Phe Glu Leu Met Asn He 
610 615 620 

30 atg ttt gtg cca act aat ctt eca cca ctt tat taa 
Met Phe Val Pro Thr Asn Leu Pro Pro Leu Tyr 
625 630 635 



35 <210> 46 
<211> 635 
<212> PRT 

<213> Bacillus thuringiensis 



1908 
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130 135 140 

Ser lie Thr Ser Ser Val Asn Thr Met Gin Gin Leu Phe Leu Asn Arg 
145 150 155 160 

Leu Pro Gin Phe Gin lie Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 
' 5 165 170 175 

Phe Ala Gin Ala Ala Asn Met His Leu Ser Phe He Arg Asp Val He 

180 185 190 

Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Leu Arg Thr Tyr 
195 200 205 

10 Arg Asp Tyr Leu Arg Asn Tyr Thr Arg Asp Tyr Ser Asn Tyr Cys He 
210 215 220 

Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His Asp 
225 230 235 240 

Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 
15 245 250 255 

Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Met Val Ser Ser Gly 

260 265 270 

Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Gin Gin Thr Gin Ser Phe 
275 280 285 

20 Thr Ala Gin Asn Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser 
290 295 300 

Asn Tyr He Leu Ser Gly He Ser Gly Asn Arg Leu Ser Thr Thr Phe 
305 310 315 320 

Pro Asn He Gly Gly Leu Pro Gly Ser Thr Thr He His Ser Leu Asn 
25 325 330 335 

Ser Ala Arg Val Asn Tyr Ser Gly Gly Val Ser Ser Gly Leu He Gly 

340 345 350 

Ala Thr Asn Leu Asn His Asn Phe Asn Cys Ser Thr Val Leu Pro Pro 
355 360 365 

30 Leu Ser Thr Pro Phe Val Arg Ser Trp Leu Asp Ser Gly Thr Asp Arg 
370 375 380 

Glu Gly Val Ala Thr Ser Thr Thr Trp Gin Thr Glu Ser Phe Gin He 
385 390 395 400 

Thr Ser Gly Leu Arg Cys Gly Ala Phe Pro Phe Ser Ala Arg Gly Asn 
35 405 410 415 

Ser Asn Tyr Phe Pro Asp Tyr Phe He Arg Asn lie Ser Gly Val Pro 

420 425 430 

Leu Val He Arg Asn Glu Asp Leu Thr Arg Pro Leu His Tyr Asn Gin 
435 440 445 

40 He Arg Asn He Glu Ser Pro Ser Gly Thr Pro Gly Gly Leu Arg Ala 
450 455 460 

Tyr Met Val Ser val His Asn Arg Lys Asn Asn He Tyr Ala Ala His 
465 470 475 480 

Glu Asn Gly Thr Met He His Leu Ala Pro Glu Asp Tyr Thr Gly Phe 
45 485 490 495 

Thr He Ser Pro He His Ala Thr Gin Val Asn Asn Gin Thr Arg Thr 

500 505 510 

Phe He Ser Glu Lys Phe Gly Asn Gin Gly Asp Ser Leu Arg Phe Glu 
515 520 525 

50 Gin Ser Asn Thr Thr Ala Arg Tyr Thr Leu Arg Gly Asn Gly Asn Ser 
530 535 540 

Tyr Asn Leu Tyr Leu Arg Val Ser Ser He Gly Asn Ser Thr He Arg 
545 550 555 560 

Val Thr He Asn Gly Arg Val Tyr Thr Ala Ser Asn Val Asn Thr Asn 
55 565 570 575 

Thr Asn Asn Asp Gly Val Asn Asp Asn Gly Ala Arg Phe Ser Asp He 
580 585 590 
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Asn lie Gly Asn Val Val Ala Ser Asp Asn Thr Asn Val Pro Leu Asp 

595 600 605 

He Asn val Thr Leu Asn Ser Gly Thr Gin Phe Glu Leu Met Asn He 
610 615 620 

5 Met Phe Val Pro Thr Asn Leu Pro Pro Leu Tyr 
625 630 635 



10 <210> 47 

<211> 1878 

<212> DNA 

<213> Bacillus thuringiensis 

15 <220> 

<221> CDS 

.<222> (1) . . (1878) 



20 



25 



40 



45 



<400> 47 ^ . 

atg aat act gta ttg aat aac gga aga aat act act tgt cat gca cat 

Met Asn Thr Val Leu Asn Asn Gly Arg Asn Thr Thr Cys His Ala His 

1 5 10 15 

aat gta gtt get cat gat cca ttt agt ttt gaa cat aaa tea tta aat 

Asn Val Val Ala His Asp Pro Phe Ser Phe Glu His Lys Ser Leu Asn 



20 



25 30 



acc ata gaa aaa gaa tgg aaa gaa tgg aaa aga act gat cat agt tta 

Thr He Glu Lys Glu Trp Lys Glu Trp Lys Arg Thr Asp Has Ser Leu 

30 35 40 45 

tat gta gcc cct att gtg gga act gtg ggt agt ttt eta tta aag aaa 

Tyr val Ala Pro lie Val Gly Thr Val Gly Ser Phe Leu Leu Lys Lys 



50 



55 60 



48 



96 



144 



192 



gta ggg agt ctt gtt gga aaa agg ata ctg agt gag tta cag aat tta 240 ^ 
val Gly Ser Leu Val Gly Lys Arg He Leu Ser Glu Leu Gin Asn Leu 

75 80 



65 70 



att ttt cct agt ggt agt ata gat tta atg caa gag att tta aga gcg 
He Phe Pro Ser Gly Ser He Asp Leu Met Gin Glu He Leu Arg Ala 
85 90 

aca gaa caa ttc ata aat caa agg ctt aat gca gac acc ctt ggt cgt 
Thr Glu Gin Phe He Asn Gin Arg Leu Asn Ala Asp Thr Leu Gly Arg 



100 



105 110 



gta aat gca gaa ttg gca ggt ctt caa gcg aat gtg gca gag ttt aat 
val Asn Ala Glu Leu Ala Gly Leu Gin Ala Asn Val Ala Glu Phe Asn 



50 115 



120 125 



288 



336 



384 



432 



cga caa gta gat aat ttt tta aac cct aat caa aac cct gtt cct tta 
Arq Gin Val Asp Asn Phe Leu Asn Pro Asn Gin Asn Pro Val Pro Leu 
130 135 140 

" gca ata att gat tea gtt aat aca ttg cag caa tta ttt eta agt aga 480 
Ala He He Asp Ser Val Asn Thr Leu Gin Gin Leu Phe Leu Ser Arg 
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145 150 155 160 

tta cca cag ttc cag ata caa ggc tat caa ctg tta tta tta cct tta 528 
Leu Pro Gin Phe Gin lie Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 
5 165 170 175 

ttt gca cag gca gcc aat tta cat ctt tct ttt att aga gat gtc ate 576 
Phe Ala Gin Ala Ala Asn Leu His Leu Ser Phe He Arg Asp Val He 
180 185 190 

10 

ctt aat gca gat gaa tgg ggc att tea gca gca aca gta cgc aca tat 624 
Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Val Arg Thr Tyr 
195 200 205 

IS aga gat cac ctg aga aat ttc aca aga gat tac tct aat tat tgt ata 672 
Arg Asp His Leu Arg Asn Phe Thr Arg Asp Tyr Ser Asn Tyr Cys He 
210 215 220 

aat acg tat caa act gca ttt aga ggt tta aac act cgt tta cac gat 720 
20 Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His Asp 
225 230 235 240 

atg tta gaa ttt aga aca tat atg ttt tta aat gta ttt gaa tat gtc 768 
Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 
25 245 250 255 

tct ate tgg teg tta ttt aaa tat caa age ctt eta gta tct tec ggc 816 

Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Leu Val Ser Ser Gly 

260 265 270 

30 

get aat tta tat gcg agt ggt agt ggt cca aca caa tea ttt aca gca 864 

Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Thr Gin Ser Phe Thr Ala 
275 280 285 

35 caa aac tgg cca ttt tta tat tct ctt ttc caa gtt aat tct aat tat 912 
Gin Asn Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser Asn Tyr 
290 295 300 

gta tta aat ggt ttg agt ggt get agg acc acc att act ttc cct aat 960 
40 Val Leu Asn Gly Leu Ser Gly Ala Arg Thr Thr He Thr Phe Pro Asn 
305 310 315 320 

att ggt ggt ctt ccc ggt tct acc aca act caa aca ttg cat ttt gcg 1008 
He Gly Gly Leu Pro Gly Ser Thr Thr Thr Gin Thr Leu His Phe Ala 
45 325 330 335 

agg att aat tat aga ggt gga gtg tea tct age cgc ata ggt caa get 1056 

Arg He Asn Tyr Arg Gly Gly Val Ser Ser Ser Arg He Gly Gin Ala 
340 345 350 

50 

aat ctt aat caa aac ttt aac att tec aca ctt ttc aat cct tta caa 1104 

Asn Leu Asn Gin Asn Phe Asn He Ser Thr Leu Phe Asn Pro Leu Gin 
355 360 365 

55 aca ccg ttt att aga agt tgg eta gat tct ggt aca gat egg gag ggc 1152 
Thr Pro Phe He Arg Ser Trp Leu Asp Ser Gly Thr Asp Arg Glu Gly 
370 375 380 
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gtt gcc acc tct aca aac tgg caa tea gga gcc ttt gag aca act tta 
Val Ala Thr Ser Thr Asn Trp Gin Ser Gly Ala Phe Glu Thr Thr Leu 
385 390 395 400 

5 

tta cga ttt age att ttt tea get egt ggt aat teg aac ttt tte cea 
Leu Arg Phe Ser lie Phe Ser Ala Arg Gly Asn Ser Asn Phe Phe Pro 
405 410 415 

10 gat tat ttt ate cgt aat att tct ggt gtt gtt ggg act att age aac 
Asp Tyr Phe lie Arg Asn lie Ser Gly Val Val Gly Thr He Ser Asn 
420 425 430 



50 ttt tea gat att aat ate ggt aat gta gtg gca agt get aat act aat 
Phe Ser Asp He Asn He Gly Asn Val Val Ala Ser Ala Asn Thr Asn 
580 585 590 

gta cea tta gat ata caa gtg aca ttt aac gac aat cea caa ttt gag 
55 val Pro Leu Asp He Gin Val Thr Phe Asn Asp Asn Pro Gin Phe Glu 
595 600 605 



1200 



1248 



1296 



gca gat tta gca aga cct eta cac ttt aat gaa ata aga gat ata gga 1344 
15 Ala Asp Leu Ala Arg Pro Leu His Phe Asn Glu He Arg Asp He Gly 
435 440 445 ' 

aeg aca gca gtc get age ctt gta aca gtg cat aac aga aaa aat aat 1392 
Thr Thr Ala Val Ala Ser Leu Val Thr Val His Asn Arg Lys Asn Asn 
20 4 5 0 4 5 5 4 6 0 

ate tat gac act cat gaa aat ggt act atg att cat tta gcg cea aat 
He Tyr Asp Thr His Glu Asn Gly Thr Met He His Leu Ala Pro Asn 
465 470 475 480 

gac tat aca gga ttt acc gta tct cea ata cat gcc act caa gta aat 
Asp Tyr Thr Gly Phe Thr Val Ser Pro He His Ala Thr Gin Val Asn 
485 490 495 

30 aat caa att cga acg ttt att tec gaa aaa tat ggt aat cag ggt gat 
Asn Gin He Arg Thr Phe He Ser Glu Lys Tyr Gly Asn Gin Gly Asp 
500 505 510 

tec ttg aga ttt gag eta age aac aca acg get cga tac aca ctt aga 
35 Ser Leu Arg Phe Glu Leu Ser Asn Thr Thr Ala Arg Tyr Thr Leu Arg 
515 520 525 



1440 
1486 
1536 
1584 . 



ggg aat gga aat agt tac aat ctt tat tta aga gta tct tea ata gga 1632 

Gly Asn Gly Asn Ser Tyr Asn Leu Tyr Leu Arg Val Ser Ser He Gly 

40 530 535 540 

agt tec aca att cga gtt act ata aac ggt aga gtt tat act gca aat 

Ser Ser Thr He Arg Val Thr He Asn Gly Arg Val Tyr Thr Ala Asn 

545 550 555 560 

45 ^ . 

gtt aat act acc aca aat aat gat gga gta ctt gat aat gga get cgt 

Val Asn Thr Thr Thr Asn Asn Asp Gly Val Leu Asp Asn Gly Ala Arg 
565 570 575 



1680 



1728 



1776 



1824 
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ctt atg aat att atg ttg ttc caa eta ate ttc cac cac ttt att aag 1872 
Leu Met Asn lie Met Leu Phe Gin Leu lie Phe His His Phe lie Lys 
610 615 620 

gtt tga 1878 

Val 

625 



10 <210> 48 
<211> 625 
<212> PRT 

<213> Bacillus thuringiensis 
15 <400> 48 

Met Asn Thr Val Leu Asn Asn Gly Arg Asn Thr Thr Cys His Ala His 

1 5 10 ' 15 

Asn Val Val Ala His Asp Pro Phe Ser Phe Glu His Lys Ser Leu Asn 
20 25 30 

20 Thr lie Glu Lys Glu Trp Lys Glu Trp Lys Arg Thr Asp His Ser Leu 
35 40 45 

Tyr Val Ala Pro lie Val Gly Thr Val Gly Ser Phe Leu Leu Lys Lys 

50 55 60 

Val Gly Ser Leu Val Gly Lys Arg lie Leu Ser Glu Leu Gin Asn Leu 
25 65 70 75 80 

lie Phe Pro Ser Gly Ser lie Asp Leu Met Gin Glu lie Leu Arg Ala 

85 90 95 

Thr Glu Gin Phe lie Asn Gin Arg Leu Asn Ala Asp Thr Leu Gly Arg 
100 105 110 

30 Val Asn Ala Glu Leu Ala Gly Leu Gin Ala Asn Val Ala Glu Phe Asn 
115 120 125 

Arg Gin Val Asp Asn Phe Leu Asn Pro Asn Gin Asn Pro Val Pro Leu 

130 135 140 

Ala lie He Asp Ser Val Asn Thr Leu Gin Gin Leu Phe Leu Ser Arg 
35 145 150 155 160 

Leu Pro Gin Phe Gin He Gin Gly Tyr Gin Leu Leu Leu Leu Pro Leu 

165 170 175 

Phe Ala Gin Ala Ala Asn Leu His Leu Ser Phe He Arg Asp Val He 
180 185 190 

40 Leu Asn Ala Asp Glu Trp Gly He Ser Ala Ala Thr Val Arg Thr Tyr 
195 200 205 

Arg Asp His Leu Arg Asn Phe Thr Arg Asp Tyr Ser Asn Tyr Cys He 

210 215 220 

Asn Thr Tyr Gin Thr Ala Phe Arg Gly Leu Asn Thr Arg Leu His Asp 
45 225 230 235 240 

Met Leu Glu Phe Arg Thr Tyr Met Phe Leu Asn Val Phe Glu Tyr Val 

245 250 255 

Ser He Trp Ser Leu Phe Lys Tyr Gin Ser Leu Leu Val Ser Ser Gly 
260 265 270 

50 Ala Asn Leu Tyr Ala Ser Gly Ser Gly Pro Thr Gin Ser Phe Thr Ala 
275 280 285 

Gin Asn Trp Pro Phe Leu Tyr Ser Leu Phe Gin Val Asn Ser Asn Tyr 

290 295 300 

Val Leu Asn Gly Leu Ser Gly Ala Arg Thr Thr He Thr Phe Pro Asn 
55 305 310 315 320 

He Gly Gly Leu Pro Gly Ser Thr Thr Thr Gin Thr Leu His Phe Ala 
325 330 335 



BNSDOCID: <W0 0119859A2 



wo 01/19859 



PCTAJSOO/25361 



-66- 

Arg He Asn Tyr Arg Gly Gly Val Ser Ser Ser Arg He Gly Gin Ala 

340 345 350 

Asn Leu Asn Gin Asn Phe Asn He Ser Thr Leu Phe Asn Pro Leu Gin 

355 360 365 

Thr Pro Phe He Arg Ser Trp Leu Asp Ser Gly Thr Asp Arg Glu Gly 

370 375 380 

Val Ala Thr Ser Thr Asn Trp Gin Ser Gly Ala Phe Glu Thr Thr Leu 
385 390 395 400 

Leu Arg Phe Ser He Phe Ser Ala Arg Gly Asn Ser Asn Phe Phe Pro 

405 410 415 

Asp Tyr Phe He Arg Asn He Ser Gly Val Val Gly Thr He Ser Asn 

420 425 430 

Ala Asp Leu Ala Arg Pro Leu His Phe Asn Glu He Arg Asp He Gly 

435 440 445 

Thr Thr Ala Val Ala Ser Leu Val Thr Val His Asn Arg Lys Asn Asn 

450 455 460 

He Tvr Asp Thr His Glu Asn Gly Thr Met He His Leu Ala Pro Asn 
465 470 475 480 

ASP Tyr Thr Gly Phe Thr Val Ser Pro He His Ala Thr Gin Val Asn 

485 490 495 

Asn Gin He Arg Thr Phe He Ser Glu Lys Tyr Gly Asn Gin Gly Asp 

500 505 510 

Ser Leu Arg Phe Glu Leu Ser Asn Thr Thr Ala Arg Tyr Thr Leu Arg 
515 520 525 

25 Gly Asn Gly Asn Ser Tyr Asn Leu Tyr Leu Arg Val Ser Ser He Gly 
530 535 540 

Ser Ser Thr He Arg Val Thr He Asn Gly Arg Val Tyr Thr Ala Asn 
545 550 555 560 

Val Asn Thr Thr Thr Asn Asn Asp Gly Val Leu Asp Asn Gly Ala Arg 

565 570 575 

Phe ser Asp He Asn He Gly Asn Val Val Ala Ser Ala Asn Thr Asn 

580 585 590 

val Pro Leu Asp He Gin Val Thr Phe Asn Asp Asn Pro Gin Phe Glu 
595 600 605 

35 Leu Met Asn He Met Leu Phe Gin Leu He Phe His His Phe He Lys 
610 615 620 

Val 
625 



10 



15 



20 



30 



40 



45 



<210> 49 
<211> 143 
<212> DNA 

<213> Bacillus thuringiensis 



<220> 

<221> modif ied_base 
<222> (8).. (140) 
50 <223> N = A, T, C or G 

<400> 49 



gtcgtganag gnccaggatt tacaggaggg gatatactnc gaagaacggn cggtggtgca 60 
tttggaacna ttagngctan ggctantgcc ccnttaacac aacaatatcg nataagatta i^u 
55 cgctntgctt ctacnacaan ttt 
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67 



10 



IS 



20 



25 



30 



3S 



40 



<210> 50 
<211> 47 
<212> PRT 

<213> Bacillus thuringiensis 
<220> 

<221> SITE 
<222> (3) 

<223> X = R, I, K, or T 

<220> 

<221> SITE 
<222> (17) 

<223> X = A, D, G or V 
<220> 

<22X> SITE 
<222> (25) 
<223> X = S, or R 

<220> 

<221> SITE 
<222> (27) 

<223> X = R, K, M, or T 
<220> 

<221> SITE 
<222> (29) 

<223> X = N, I, S, or T 

<220> 

<221> SITE 
<222> (42) 

<223> X = C, F, S, or y 
<220> 

<221> SITE 
<222> (47) 

<223> X = N, I, S, or T 



<400> 50 

Val Val Xaa Gly Pro Gly Phe Thr Gly Gly Asp lie Leu Arg Arg Thr 
15 10 15 

45 xaa Gly Gly Ala Phe Gly Thr lie Xaa Ala Xaa Ala Xaa Ala Pro Leu 
20 25 30 

Thr Gin Gin Tyr Arg lie Arg Leu Arg Xaa Ala Ser Thr Thr Xaa 
35 40 45 

SO 

<210> 51 
<211> 42 
<212> DNA 
55 <213> Artificial Sequence 

<220> 
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<223> Description of Artificial Sequence: Primer 
<400> 51 

tggatacttg atcaatatga taatccgtca catctgtttt ta 42 

5 

<210> 52 
<211> 61 
<212> DNA 
10 <213> Artificial Sequence 

<220> 

<:223> Description of Artificial Sequence: primer 
IS <400> 52 

agtaacggtg ttactattag cgagggcggt ccattcttta aggtcgtgca 'cttcagttag 60 
c " 



20 <210> 53 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



2S <220> 

<;223> Description of Artificial Sequence: primer 

<400> 53 

cgacttctcc tgctaatgga gg 

30 

<210> 54 

<211> 28 

<212> DNA 

35 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 

40 <400> 54 

ctcgctaata gtaacaccgt tacttgcc 



<210> 55 

45 <211> 61 

• <212> DNA 

<213> /Artificial Sequence 

<220> 

SO <223> Description of Artificial Sequence: primer 



<400> 55 

atttagtagc atgcgttgca ctttgtgcat tttttcataa gatgagtcat atgttttaaa 60 
t 



<210> 56 
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<211> 23 
<212> DHA 

<213> Artificial Sequence 
S <220> 

<223> Description of Artificial Sequence: primer 
<400> 56 

ggatagcact catcaaaggt acc 

to 

<210> 57 
<211> 22 
<212> DNA 
15 <213> Artificial Sequence 

\ 

' <220> 

<223> Description of Artificial Sequence: primer 

20 <400> 57 

gtwtggacsc rtcghgatgt gg 



<210> 58 
2S <211> 40 
<212> DNA 

<213> Artificial Sequence 

<220> 

30 <223> Description of Artificial Sequence: primer 
<400> 58 

taatttctgc tagcccwatt tctggattta attgttgatc 

35 

«210> 59 
<211> X9 
<212> DNA 

<213> Artificial Sequence 

40 

<220> 

<221> modif ied_base 
<222> (3) . . (12) 
<223> W = A, T 

45 

<220> 

<221> modif ied_base 
<222> (6) 

<223> N = A, C, T 

50 

<220> 

<221> modif ied_base 
<222> (10) 
<223> M = A, C 

55 

<220> 

<221> modif ied_base 
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23 



22 
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<222> (18) 
<223> R = A, G 

<220> 

<221> modif ied_base 
<222> (15) 
<223> D = A, G 



10 



<220> 

<223> Description of Artificial Sequence: primer 



<400> 59 

atwacncaam twccdttrg 



19 



IS 



20 



<210> 60 
<211> 17 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



2S 



<400> 60 

aatgcagatg aatgggg 



17 



30 



<210> 61 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



c220> 

c223> Description of Artificial Sequence: primer 



35 <400> 61 

tgataatgga gctcgtt 



17 



40 



45 



so 



55 



<210> 62 

<211> 3684 

<212> DNA 

<213> Baci 

<400> 62 

ttgacttcaa 

tcgaatcatt 

atagccgagg 

aacatagctg 

ttttatagtt 

ctagaacatg 

cttgctcgat 

tggctagaaa 

ttagaacttg 

ttattaatgg 

ctttttggta 

gtggaaaaaa 

aatttgagag 

acgctaggag 



llus thuringiensis 



ataggaaaaa 
ccgcacaaat 
ggaacaatat 
gtagaatact 
ttcttgttgg 
tcgaacatct 
tacaaggttt 
accgtgatga 
attttcttaa 
tatatgctca 
gtgaatttgg 
cgagagaata 
ggacaaatgc 
tattagatct 



tgagaatgaa 
gaatctatca 
cgatccattt 
aggtgtatta 
tgaattatgg 
tataagacaa 
aggaaattcc 
tgcaagaacg 
tgcgatgccg 
agctgcaaat 
gcttacatcc 
ttctgattat 
tgaaagttgg 
agtggcacta 



attataaatg 
accgatgctc 
gttagcgcat 
ggcgtaccgt 
ccccgcggca 
caagtaacag 
tttagagcct 
agaagtgttc 
cttttcgcaa 
ttacacctat 
caagaaattc 
tgcgcaagat 
ttgcgatata 
ttcccaagct 



ctttatcgat 
gtattgagga 
caacagtcca 
ttgctggaca 
gagatccttg 
aaaatactag 
atcaacagtc 
tttataccca 
ttagaaacca 
tattattgag 
aacgttatta 
ggtataatac 
atcaattccg 
atgacacgcg 



tccagctgta 60 
tagcttgtgt 120 
aacgggtatt 180 
aatagctagt 240 
ggaaattttc 300 
ggatacggct 360 
acttgaagat 420 
atatatagcc 4 80 
agaagttcca 540 
agatgcctct 600 
tgagcgccaa 660 
gggtttaaat 720 
tagagactta 780 
tgtttatcca 840 
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atgaatacca gtgctcaatt aacaagagaa atttatacag atccaattgg gagaacaaat 900 
gcaccttcag gatttgcaag tacgaattgg tttaataata atgcaccatc gttttctgcc 960 
atagaggctg ccgttattag gcctccgcat ctacttgatt ttccagaaca gcttacaatt 1020 
ttcagcgtat taagtcgatg gagtaatact caatatatga attactgggt gggacataga 1080 
5 cttgaatcgc gaacaataag ggggtcatta agtacctgga cacacggaaa taccaatact 1140 
tctattaatc ctgtaacatt acagttcaca tctcgagacg tttatagaac agaatcattt 1200 
gcagggataa atatacttct aactactcct gtgaatggag taccttgggc tagatttaat 1260 
tggagaaatc ccctgaattc tcttagaggt agccttctct atactatagg gtatactgga 1320 
gtggggacac aactatttga ttcagaaact gaattaccac cagaaacaac agaacgacca 1380 
10 aattatgaat cttacagtca tagattatct aatataagac taatatcagg aaacactttg 1440 
agagcaccag tatattcttg gacgcaccgt agtgcagatc gtacaaatac cattagttca 1500 
gatagcataa cacaaatacc attggtaaaa tcattcaacc ttaattcagg tacctctgta 1560 
gtcagtggcc caggatttac aggaggggat ataatccgaa ctaacgttaa tggtagtgta 1620 
ctaagtatgg gtcttaattt taataataca tcattacagc ggtatcgcgt gagagttcgt 1680 
15 tatgctgctt ctcaaacaat ggtcctgagg gtaactgtcg gagggagtac tacttttgat 174 0 
caaggattcc ctagtactat gagtgcaaat gagtctttga catctcaatc atttagattt 1800 
gcagaatttc ctgtaggtat tagtgcatct ggcagtcaaa ctgctggaat aagtataagt 1860 
aataatgcag gtagacaaac gtttcacttt gataaaattg aattcattcc aattactgca 1920 
accttcgaag cagaatatga tttagaaaga gcgcaagagg cggtgaatgc tctgtttact 1980 
20 aatacgaatc caagaaggtt gaaaacaggt gtgacagatt atcatattga tgaagtatcc 2 04 0 
aatttagtgg cgtgtttatc ggatgaattc tgcttggatg aaaagagaga attacttgag 2100 
aaagtgaaat atgcgaaacg actcagtgat gaaagaaact tactccaaga tccaaacttc 2160 
acatccatca ataagcaacc agacttcaat tctaataatg agcaatcgaa tttcacatct 2220 
atccatgaac aaCctgaaca tggatggtgg ggaagtgaga acattacaat ccaggaagga 2280 
25 aatgacgtat ttaaagagaa ttacgtcaca ctaccgggta cttttaatga gtgttatccg 2340 
acgtatttat atcaaaaaat aggggaggcg gaattaaaag cttatactcg ctaccaatta 2400 
agtggctata ttgaagatag tcaagattta gagatatatt tgattcgtta caatgcgaaa 2460 
catgaaacat tggatgttcc aggtaccgag tccgtatggc cgctttcagt tgaaagccca 2520 
atcggaaggt gcggagaacc gaatcgatgc gcaccacatt ttgaatggaa tcctgatcta 2580 
30- gattgttcct gcagagatgg agaaaaatgt gcgcatcatt cccatcattt ctctttggat 2640 
attgatgttg gatgcataga cttgcatgag aacctaggcg tgtgggtggt attcaagatt 2700 
aagacgcagg aaggtcatgc aagactaggg aacctggaat ttattgaaga gaaaccatta 2760 
ttaggagaag cactgtctcg tgtgaagaga gcagagaaaa aatggagaga caaacgtgaa 2820 
aaactacaat tggaaacaaa acgagtatat acagaggcaa aagaagctgt ggatgcttta 2880 
35 tttgtagatt ctcaatatga tagattacaa gcggatacaa acattggcat gattcatgcg 2940 
gcagataaac ttgttcatcg aattcgagag gcgtatcttt cagaattatc tgttatccca 3000 
ggtgtaaatg cggaaatttt tgaagaatta gaaggtcgca ttatcactgc aatctcccta 3060 
tacgatgcga gaaatgtcgt taaaaatggt gattttaata atggattagc atgctggaat 3120 
gtaaaagggc atgtagatgt acaacagagc catcaccgtt ctgtccttgt tatcccagaa 3180 
40 tgggaagcag aagtgtcaca agcagttcgc gtctgtccgg ggcgtggcta tatcctccgt 3240 
gtcacagcgt acaaagaggg atatggagag ggttgtgtaa ctatccatga aatcgagaac 3300 
aatacagacg aactaaaatt taaaaactgt gaagaagagg aagtgtatcc aacggataca 3360 
ggaacgtgta atgattatac tgcacaccaa ggtacagcag tatgtaattc ccgtaatgct 3420 
ggatatgagg atgcatatga agttgatact acagcatctg ttaattacaa accgacttat 3480 
45 gaagaagaaa cgtatacaga tgtacgaaga gataatcatt gtgaatatga cagagggtat 3540 
gtgaattatc caccagtacc agctggttat atgacaaaag aattagaata cttcccagaa 3600 
accgataagg tatggattga gattggagaa acggaaggga agtttattgt agacagcgtg 3660 
gaattactcc ttatggagga atag 3684 

50 

<210> 63 
<211> 1227 
<212> PRT 

<213> Bacillus thuringiensis 
55 <400> 63 

Leu Thr Ser Asn Arg Lys Asn Glu Asn Glu lie lie Asn Ala Leu Ser 
15 10 15 
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lie Pro Ala Val Ser Asn His Ser Ala Gin Met Asn Leu Ser Thr Asp 
20 25 30 

5 Ala Arg lie Glu Asp Ser Leu Cys He Ala Glu Gly Asn Asn He Asp 
35 40 45 

Pro Phe Val Ser Ala Ser Thr Val Gin Thr Gly He Asn He Ala Gly 
50 55 60 

Ara He Leu Gly Val Leu Gly Val Pro Phe Ala Gly Gin He Ala Ser 
65 70 75 80 

Phe Tyr Ser Phe Leu Val Gly Glu Leu Trp Pro Arg Gly Arg Asp Pro 
,5 85 -90 95 

Trp Glu He Phe Leu Glu His Val Glu His Leu He Arg Gin Gin Val 
100 105 110 

20 Thr Glu Asn Thr Arg Asp Thr Ala Leu Ala Arg Leu Gin Gly Leu Gly 

115 120 125 

Asn Ser Phe Arg Ala Tyr Gin Gin Ser Leu Glu Asp Trp Leu Glu Asn 
130 135 140 

Ara ASP Asp Ala Arg Thr Arg Ser Val Leu Tyr Thr Gin Tyr He Ala 
145 150 155 160 

Leu Glu Leu Asp Phe Leu Asn Ala Met Pro Leu Phe Ala He Arg Asn 
30 165 170 175 

Gin Glu Val Pro Leu Leu Met Val Tyr Ala Gin Ala Ala Asn Leu His 
180 185 190 

35 Leu Leu Leu Leu Arg Asp Ala Ser Leu Phe Gly Ser Glu Phe Gly Leu 
195 200 205 

Thr Ser Gin Glu He Gin Arg Tyr Tyr Glu Arg Gin Val Glu Lys Thr 
210 215 220 

40 

Arq Glu Tyr Ser Asp Tyr Cys Ala Arg Trp Tyr Asn Thr Gly Leu Asn 
230 235 240 



225 



Asn 



45 



Leu Arg Gly Thr Asn Ala Glu Ser Trp Leu Arg Tyr Asn Gin Phe 
245 250 255 



Arq Arq Asp Leu Thr Leu Gly Val Leu Asp Leu Val Ala Leu Phe Pro 
260 265 270 

50 ser Tyr Asp Thr Arg Val Tyr Pro Met Asn Thr Ser Ala Gin Leu Thr 
275 280 285 

Arg Glu He Tyr Thr Asp Pro He Gly Arg Thr Asn Ala Pro Ser Gly 
290 295 300 

Phe Ala Ser Thr Asn Trp Phe Asn Asn Asn Ala Pro Ser Phe Ser Ala 
305 310 315 320 
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lie Glu Ala Ala Val lie Arg Pro Pro His Leu Leu Asp Phe Pro Glu 
325 330 335 

5 Gin Leu Thr lie Phe Ser Val Leu Ser Arg Trp Ser Asn Thr Gin Tyr 
340 345 350 

Met Asn Tyr Trp Val Gly His Arg Leu Glu Ser Arg Thr He Arg Gly 
355 360 365 

10 

Ser Leu Ser Thr Trp Thr His Gly Asn Thr Asn Thr Ser He Asn Pro 
370 375 380 

Val Thr Leu Gin Phe Thr Ser Arg Asp Val Tyr Arg Thr Glu Ser Phe 
IS 385 390 395 400 

Ala Gly He Asn He Leu Leu Thr Thr Pro Val Asn Gly Val Pro Trp 
405 410 415 

20 Ala Arg Phe Asn Trp Arg Asn Pro Leu Asn Ser Leu Arg Gly Ser Leu 
420 425 430 

Leu Tyr Thr He Gly Tyr Thr Gly Val Gly Thr Gin Leu Phe Asp Ser 
435 440 445 

2S 

Glu Thr Glu Leu Pro Pro Glu Thr Thr Glu Arg Pro Asn Tyr Glu Ser 
450 455 460 

Tyr Ser His Arg Leu Ser Asn He Arg Leu He Ser Gly Asn Thr Leu 
30 465 470 475 480 

Arg Ala Pro Val Tyr Ser Trp Thr His Arg Ser Ala Asp Arg Thr Asn 
485 490 495 

35 Thr He Ser Ser Asp Ser He Thr Gin He Pro Leu Val Lys Ser Phe 
500 505 510 

Asn Leu Asn Ser Gly Thr Ser Val Val Ser Gly Pro Gly Phe Thr Gly 
515 520 525 

40 

Gly Asp He He Arg Thr Asn Val Asn Gly Ser Val Leu Ser Met Gly 
530 535 540 

Leu Asn Phe Asn Asn Thr Ser Leu Gin Arg Tyr Arg Val Arg Val Arg 
45 545 550 555 560 

Tyr Ala Ala Ser Gin Thr Met Val Leu Arg Val Thr Val Gly Gly Ser 
565 570 575 

50 Thr Thr Phe Asp Gin Gly Phe Pro Ser Thr Met Ser Ala Asn Glu Ser 
580 585 590 

Leu Thr Ser Gin Ser Phe Arg Phe Ala Glu Phe Pro Val Gly He Ser 
595 600 605 

55 

Ala Ser Gly Ser Gin Thr Ala Gly He Ser He Ser Asn Asn Ala Gly 
610 615 620 
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Arg Gin Thr Phe His Phe Asp Lys lie Glu Phe He Pro He Thr Ala 
625 630 635 640 

5 Thr Phe Glu Ala Glu Tyr Asp Leu Glu Arg Ala Gin Glu Ala Val Asn 
645 650 655 

Ala Leu Phe Thr Asn Thr Asn Pro Arg Arg Leu Lys Thr Gly Val Thr 
660 665 670 

10 

Asp Tyr His He Asp Glu Val Ser Asn Leu Val Ala Cys Leu Ser Asp 
675 680 685 

Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu Leu Glu Lys Val Lys Tyr 
15 690 695 700 

Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn Phe 
705 710 715 720 

20 Thr Ser He Asn Lys Gin Pro Asp Phe Asn Ser Asn Asn Glu Gin Ser 
725 730 735 

Asn Phe Thr Ser He. His Glu Gin Ser Glu His Gly Trp Trp Gly Ser 
740 745 750 

25 

Glu Asn He Thr He Gin Glu Gly Asn Asp Val Phe Lys Glu Asn Tyr 
755 760 765 

Val Thr Leu Pro Gly Thr Phe Asn Glu Cys Tyr Pro Thr Tyr Leu Tyr 
30 770 775 780 

Gin Lys He Gly Glu TVla Glu Leu Lys Ala Tyr Thr Arg Tyr Gin Leu 
785 790 795 800 

35 Ser Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg 
805 810 815 

Tyr Asn Ala Lys His Glu Thr Leu Asp Val Pro Gly Thr Glu Ser Val 
820 825 830 

40 

Trp pro Leu Ser Val Glu Ser Pro He Gly Arg Cys Gly Glu Pro Asn 
835 840 845 

Arg Cys Ala Pro His Phe Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys 
45 850 855 860 

ArcT ASP Gly Glu Lys Cys Ala His His Ser His His Phe Ser Leu Asp 
865 870 875 880 

50 He Asp Val Gly Cys He Asp Leu His Glu Asn Leu Gly Val Trp Val 

885 890 895 

Val Phe Lys He Lys Thr Gin Glu Gly His Ala Arg Leu Gly Asn Leu 
900 905 910 

Glu Phe He Glu Glu Lys Pro Leu Leu Gly Glu Ala Leu Ser Arg Val 
915 920 925 
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Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin Leu 
930 935 940 

5 Glu Thr Lys Arg Val Tyr Thr Glu Ala Lys Glu Ala Val Asp Ala Leu 
945 950 955 960 

Phe Val Asp Ser Gin Tyr Asp Arg Leu Gin Ala Asp Thr Asn He Gly 
965 970 975 

10 

Met lie His Ala Ala Asp Lys Leu Val His Arg He Arg Glu Ala Tyr 
980 985 990 

Leu Ser Glu Leu Ser Val He Pro Gly Val Asn Ala Glu He Phe Glu 
IS 995 1000 1005 

Glu Leu Glu Gly Arg He He Thr Ala He Ser Leu Tyr Adp Ala Arg 
1010 1015 1020 

20 Asn Val Val Lys Asn Gly Asp Phe Asn Asn Gly Leu Ala Cys Trp Asn 
1025 1030 1035 1040 

Val Lys Gly His Val Asp Val Gin Gin Ser His His Arg Ser Val Leu 
1045 1050 1055 

25 

Val He Pro Glu Trp Glu Ala Glu Val Ser Gin Ala Val Arg Val Cys 
1060 1065 1070 

Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr 
30 1075 1080 1085 

Gly Glu Gly Cys Val Thr He His Glu He Glu Asn Asn Thr Asp Glu 
1090 1095 1100 

35 Leu Lys Phe Lys Asn Cys Glu Glu Glu Glu Val Tyr Pro Thr Asp Thr 
1105 1110 1115 1120 

Gly Thr Cys Asn Asp Tyr Thr Ala His Gin Gly Thr Ala Val Cys Asn 
1125 1130 1135 

40 

Ser Arg Asn Ala Gly Tyr Glu Asp Ala Tyr Glu Val Asp Thr Thr Ala 
1140 1145 1150 

Ser Val Asn Tyr Lys Pro Thr Tyr Glu Glu Glu Thr Tyr Thr Asp Val 
45 1155 1160 1165 

Arg Arg Asp Asn His Cys Glu Tyr Asp Arg Gly Tyr Val Asn Tyr Pro 
1170 1175 1180 

50 Pro Val Pro Ala Gly Tyr Met Thr Lys Glu Leu Glu Tyr Phe Pro Glu 
1185 1190 1195 1200 

Thr Asp Lys Val Trp He Glu He Gly Glu Thr Glu Gly Lys Phe He 
1205 1210 1215 

55 

Val Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1220 1225 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article I7(2)(a) tor the following reasons: 
because they relate to subject matter not required to be searched by this Authority, namely: 



^' ^ bS^they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



^" O bS^^ey are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box 



II Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 

1.1 — I As all required additional search fees were timely paid by the applicant, this Intemational Search Report covers all 
I — > searchable claims. 

2 n AS all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
' — of any additional fee. 

3 r~l AS only some of the required additional search fees were timely paid by the applicant, this Intemational Search Report 

■ LXJ covers only those claims for which fees were paid, specifically claims Nos.: 

1-11, 14-51 all partially (Inventions 1,6,10 searched) 

4 rn No required additional search fees were timely paid by the applicant. Consequently, this Intemational Search Report is 

■ ' — ' restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest Q The additional search fees were accompanied by the applicant's protest. 

No protest accompanied the payment of additional search tees. 

Form PCT/ISA/210 (continuation of first sheet (1)) (July 1998) 
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International Application No. PCTAJS 00 ^5361 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-11, 14-51 all partially 

Isolated polypeptide, polynucleotide encoding it and 
composition containing it; vector, transformed host cell, 
plant and its progeny and seeds containing the 
polynucleotide; methods of controlling Lepdopteran insects 
and for preparing insect resistant plants using the above 
polypeptide and polynucleotide; the polynucleotide and 
polypeptide being Seq Id Nos 17 and 18, respectively. 



2. Claims: 1-11, 14-51 all partially 

Subject matter as defined for invention 1 above, but limited 
to Seq Id Nos 19 and 20. 

3. Claims: 1-11, 14-51 all partially 

Subject matter as defined for invention 1 above, but limited 
to Seq Id Nos 23 and 24. 

4. Claims: 1-11, 14-51 all partially 

Subject matter as defined for invention 1 above, but limited 
to Seq Id Nos 39 and 40. 

5. Claims: 1-11, 14-51 all partially 

Subject matter as defined for invention 1 above, but limited 
to Seq Id Nos 43 and 44. 

6. Claims: 2-11, 15-51 all partially 

Subject matter as defined for invention 1 above, but limited 
to Seq Id Nos 1 and 2. 

7. Claims: 2-11, 15-51 all partially 

Subject matter as defined for invention 1 above, but limited 
to Seq Id Nos 5 and 6. 

8. Claims: 2-11, 15-51 all partially 

Subject matter as defined for invention 1 above, but limited 
to Seq Id Nos 7 and 8. 
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9. Claims: 2-11, 15-51 all partially 






Subject matter as defined for invention 1 
to Seq Id Nos 9 and 10. 


above, but limited 


1 1 n 


naims- 2-11. 15-51 all partially 


■ 




Subject matter as defined for invention 1 
to Seq Id Nos 37 and 38. 


above, but limited 


1 1 1 
1 


naimc- 3-11 15-51 all partially 






Subject matter as defined for invention 1 
to Seq Id Nos 15 and 16. 


above, but limited 


1 


naimc- 3-11 15-51 all partially 






Subject matter as defined for invention ] 
to Seq Id Nos 21 and 22. 


L above, but limited 


1 1 1 
1 10 . 


naims- 3-11, 15-51 all partially 






Subject matter as defined for invention 
to Seq Id Nos 27 and 28. 


1 above, but limited 


1 14 


Claims: 3-11, 15-51 all partially 






Subject matter as defined for invention 
to Seq Id Nos 35 and 36. 


1 above, but limited 


1 1 R 
1 


Claims- 3-11, 15-51 all partially 






Subject matter as defined for invention 
to Seq Id Nos 49 and 50. 


1 above, but limited 


16 


Claims: 4-11, 16-51 all partially 






' Subject matter as defined for invention 
to Seq Id Nos 3 and 4. 


1 above, but limited 


17 


. Claims: 4-11, 16-51 all partially 






Subject matter as defined for invention 
to Seq Id Nos 11 and 12. 


1 above, but limited 
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18 


Claims: 4-11, 16-51 all partially 








Subject matter as defined for invention 1 above, 
to Seq Id Nos 13 and 14. 


but 


limited 


19 


Claims: 4-11, 16-51 all partially 








Subject matter as defined for invention 1 above, 
to Seq Id Nos 25 and 26. 


but 


limited 


20. 


Claims: 4-11, 16-51 all partially 








Subject matter as defined for invention 1 above, 
to Seq Id Nos 29 and 30. 


but 


limited 


21. 


Claims: 4-11, 16-51 all partially 








Subject matter as defined for invention 1 above, 
to Seq Id Nos 31 and 32. 


but 


1 1 mi ted 


22. 


Claims: 4-11, 16-51 all partially 








Subject matter as defined for Invention 1 above, 
to Seq Id Nos 33 and 34. 


but 


limited 


23. 


Claims: 4-11, 16-51 all partially 








Subject matter as defined for invention 1 above, 
to Seq Id Nos 41 and 42. 


but 


1 1 mi ted 


24. 


Claims: 4-11, 15-51 all partially 








Subject matter as defined for invention 1 above, 
to Seq Id Nos 45 and 46. 


but 


limited 


25. 


Claims: 4-11, 16-51 all partially 








Subject matter as defined for Invention 1 above, 
to Seq Id Nos 47 and 48. 


but 


limited 


26. 


Claims: 4-11, 15-51 all partially 








Subject matter as defined for invention 1 above, 
to Seq Id Nos 62 and 63. 


but 


limited 


27. 


Claims: 12, 13 both partially 
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A Bacillus thuringiensis cell and a" ^"secticidal 
polypeptide prepared from it, where the cell is NRRL B-21784. 

28. CI aims: 12, 13 both partially 

Subject matter as defined for invention 27 above, but 
limited to NRRL B-21783. 

29. Claims: 12, 13 both partially 

Subject matter as defined for invention 27 above, but 
limited to NRRL B-21917. 

30. Claims: 12, 13 both partially 

Subject matter as defined for invention 27 above, but 
limited to NRRL B-21786 

31. Claims: 12, 13 both partially 

Subject matter as defined for invention 27 above, but 
limited to NRRL B-21787. 

32. Claims: 12, 13 both partially 

Subject matter as defined for invention 27 above, but 
limited to NRRL B-21785. 

33. Claims: 12, 13 both partially 

Subject matter as defined for invention 27 above, but 
limited to NRRL B-21788. 

34. Claims: 12, 13 both partially 

Subject matter as defined for invention 27 above, but 
limited to NRRL B-21915. 

35. Claims: 12, 13 both partially 

Subject matter as defined for invention 27 above, but 
limited to NRRL B-21916. 
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